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(54) Translating apparatus and translating method 

(57) A translating apparatus and a translating 
method wherein a first language sentence is divided (4) 
into syntax units consisting of predetermined units of 
senterKe structure such as clauses and phrases in 
stages from large syntax units into small syntax units 
and at each stage stored (3.5) exanples most similar to 
these syntax units are detected (6) using probability 
(11) models taking into account grammatical attributes 
of the syntax units and of the examples and using gen- 
eralized linguistic knowledge arxj with reference to a 
thesaurus (12) and the syntax units are translated (7) on 
the basis of these detected examples and the results of 
translatk^n of the syntax units are compounded to gen- 
erate a second language sentence. The invention 
makes it possible to carry out high-quality translating 
efficiently and therefore at a high processing speed 
while eliminating the need for nnany superftdally differ- 
ent examples having the same meaning content to be 
prepared and the need for the thesaurus (12) to be reg- 
ularly structured or have uniform distances between 
concepts. 
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Description 

Background of the Invention 

1. Field of the Invention 

This invention relates to a translating apparatus and a translating method, and particularly to a translating appara- 
tus and a translating method for translating a first language sentence expressed in a first language into a second lan- 
guage sentence expressed in a second language using exanples. 

2. Description of the Related Art 

Translating apparatuses for translating a first language sentence expressed in a first language into a second lan- 
guage sentence expressed in a second language can be generally divided into the three classes: rule-driven translation 
apparatuses, example-using translation apparatuses, and example-driven translation apparatuses. 

Fig. 1 shows the construction of an exanple of a rule-drh/en translation apparatus. In Fig. i . an inputting part 1 con- 
sists of for example a keyboard or a voice recognition device or a character recognition device; a first language sen- 
tence is inputted into it and is converted into a form such as text data and outputted. That is. when the inputting part 1 
consists of a keyboard, a first language Sentence is inputted by the keyboard being operated and test data con-espond- 
ing to that operation is outputted. When the inputting part 1 is a voice recognition device, a first language sentence is 
inputted as voice and voice-recognized. Text data corresponding to the results of the voice recognition is then outputted. . 
When the inputting part 1 is a character recognition device, a first language sentence written for example on paper is 
inputted (read) and this first language sentence is character-recognized. Text data corresponding to the results of this 
character recognition is then outputted. 

The text data corresponding to the first language sentence outputted from the inputting part 1 is supplied to an ana- 
lyzing part 31. In the analyzing part 31, the first language sentence from the inputting part 1 is language-processed 
(analyzed) on the basis of knowledge and rules relating to the first language, and the results of this analysis are sup- 
plied to a converting part 32. The converting part 32 converts the analysis results from the analyzing part 31 into an 
intermediate language sentence of a prescribed intermediate language on the basis of knowledge and rules relating to 
the first language and a second language, and outputs this to a generating part 33. The generating part 33 generates 
from the intermediate language sentence supplied from the converting part 32 a corresponding second language serv 
teoce, that is, a translation (second language sentence) consisting of the first language sentence translated into the 
second language, on the basis of knowledge and rules relating to the second language. 

An outputting part 1 6 is made up of for example a display or a voice synthesizing device and a speaker or the like, 
and displays or outputs in a synthesized voice the second language sentence supplied from the generating part 33. 

Fig. 2 shows the construction of an example of an example-using translating apparatus. Parts in the figure con*e- 
sponding to parts in Fig. 1 have been given the same reference numerals and will not be described in the following. 
Apart from being provided with a collating part 41 and a replacing part 42. this example-using translating apparatus Is 
of the same construction as the rule-driven translation apparatus of Rg. 1. 

In this example-using translating apparatus, examples in sentence units expressed in a first language and conre- 
spondng translations consisting of the exanr^ples translated into a second language (hereinafter for convenience these 
examples and their corresponding translations will be called translation example data) are stored for exanple in the 
form of parameters, and in the collating part 41 the first language sentence outputted by the inputting part 1 is collated 
with the examples and any example matching the first language sentence is thereby detected. When there is an exam- 
ple which matches the first language sentence, the collating part 41 controls the replacing part 42 to replace the first 
language sentence with the translation corresponding to the example matching it. Accordingly, in the replacing part 42. 
the first language sentence is replaced with the example matching it and supplied to the outputting part 16. 

When on the other hand there is no exanple which matches the first language senterKe, the collating part 41 out- 
puts the first language sentence to the analyzing part 31. Thereafter, in the analyzing part 31, the converting part 32 
and the generating part 33, the same processing as in the case shown in Fig. 1 is earned out, and the second language 
sentence obtained as a result is supplied to the outputting part 16. 

Details of this kind of example-using translating apparatus are disdosed for example in Japanese Unexamined Pat- 
ent Publication No H,6-68134. 

Next, Fig. 3 shows the construction of an exanple of an exarrple-driven translating apparatus. Parts in Fig. 3 the 
same as parts in Fig. 1 have been given the same reference numerals and will not be desaibed in the following. 

In this example-driven translating apparatus, a first language sentence outputted by the inputting part 1 is supplied 
to a Converting part 51 , and when the converting part 51 receves the first language sentence from the inputting part 1 
it corTtrols a searching part 52 to search for the exanple most similar to that first language sentence. 

That is. translation example data is stored in sentence units in a translation exanple memory 53, and the searching 
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part 52 first refers to the trarislation example memory 53 and searches for an example which matches the first language 
sentence. When it finds an example which matches the first language sentence, the searching part 52 outputs this 
example and its corresponding translation to the converting part 51. In this case, the converting part 51 supplies the 
translation from the searching part 52 unchanged to the outputting part 16 as a second Ipnguage sentence. 

When on the other hand it cannot find an example which matches the first language sentence, the searching part 
52 successively reads the examples stored in the translation example memory 53 and supplies them to a similarity, 
degree calculating part 54. The searching part 52 makes the similarity degree calculating part 54 calculate a similarity 
degree expressing the conceptual similarity (the similarity in meaning) between each of the examples and the first lan- 
guage sentence using external knowledge such as lor example a thesaurus. 

That is. a thesaurus wherein words are classified on the basis of their concepts in a tree structure is stored in a 
thesaurus memory part 55. In the thesaurus, nodes of the tree structure are equivalent to meaning concepts and so- 
called leaf parts are equivalent to words. Referring to this thesaurus, the similarity degree calculating part 54 calculates 
a degree of similarity between the first language sentence and the exarrples on the basis of the classes to which con- 
cepts common to wads constituting the first language sentence and words constituting the examples belong. The 
searching part 52 then finds in the translation example memory 53 the example of which the similarity degree calcu- 
lated by the sirnilarity degree calculating part 54 Is the highest and supplies the translation corresponding to that exam- 
ple to the converting part 51. 

When the converting part 51 receives the translation from the searching part 52. it replaces those words of the 
translation which do not match (correspond with) words of the first language sentence with translations of those words 
and outputs this to the outputting part 16 as a second language sentence. 

Details of this kind of example-driven translating apparatus are disclosed for exanple in Japanese Unexamined 
Patent Publication No H.3-276367. Also, details of methods of calculating the degree of similarity between a first lan- 
guage sentence and an example are also disclosed in for example Japanese Unexamined Patent Publication No. H.4- 
188276. Japanese Unexamined Patent Publication No. H.6-274546 and Japanese Unexamined Paterrt Publication No. 
H.6-274548 as well as the aforementioned Japanese Unexamined Patent Publication No. H.3-276367. 
However, in the kinds of translating apparatus described above, there have been the following problems. 
That is. in rule-driven translation apparatuses there has been the problem that information required for translation 
is sometimes lost in the process of analysis of the first language sentence, and consequentiy results of translation of 
greetings and other set phrases have become word-for-word in tone. Also, when the first language sentence Includes 
for example provertw or other idiomatic expressions (idioms), translating these correctiy has been difficult Also, build- 
ing and maintaining as a data base the knowledge and rules relating to the first language and the second language 
used in rule-driven translation apparatuses has not been easy. 

With exarrple-using translating apparatuses, on the other hand, because the first language sentence is translated 
using a translation corresponding to an example matching it. if greetings and other set phrases and kJioms and so on 
are stored as examples, it is possible to obtain a second language sentence which is a natural expression. However, in 
an exarrple-using translating apparatus, when no example matching the first language sentence is stored, because 
processing similar to that of a rule-driven translation apparatus is carried out after all. the kinds of problem mentioned 
above arise, and furthermore storing examples completely matching the character strings of ail first language sen- 
tences that might be inputted has been problematic. 

In an example-driven translating apparatus, because translation is carried out using examples whatever the nature 
of the first language sentence, the problems of the rule-driven translation apparatus do not arise, and also it is not nec- 
essary to store examples completely matching all possible first language sentences. 

However, in an example-driven translating apparatus, when there is no example matching the first language sen- 
tence, the exanrple most similar to it is found and those of the words constituting the corresponding translation whk:h 
do not match words constituting the first language sentence are simply replaced with translations of those words and 
this becomes the translation result, i.e. the second language sentence. Therefore, in an example-driven translating 
apparatus, linguistic knowledge and rules are not reflected in a generalized form and consequently the quality of the 
translation (the accuracy of the translatbn and the naturalness of the translation result) can only be increased by 
increasing the number of examples stored in the translation example memory 53 (whereas the quality of the translations 
of a rule-driven translation apparatus can also be improved through the way in which knowledge and rules relating to 
the languages are described). Increasing the nunnber of exanples stored in the translation example memory 53 
increases the time required for the processing of searching for an example matching or similar to the first language sen- 
tence. 

The calculation of the similarity degree D (I.E) between a first language sentence I and an example phrase E in an 
example-driven translating apparatus, when the words constituting the first language sentence I are expressed i^. ie, 
• • • , it and the words constituting the example phrase E are expressed e^. e2. • • • , e, (where t represents the number 
of words constituting the first language sentence and the example respectively) has been carried out as shown in Exp. 
(1 ) by finding the distance in meaning (the conceptual distance) word<jistance(ik,eK) between each word \^ constituting 
the first language sentence I and the conresponding word % in the example phrase E. assigning to it a weighty corre- 
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spending to the importance of the word e^. and obtaining the sum total of these weighted distances. 



D{l,E)=]^word<listance(i^.e Jxweight^ (1) 

With this kind of similarity degree calculating method there have been the following kinds of problem. 

That is. for example when the meaning content of a certain example is spoken, although sometimes the example 
itself is spoken, in most cases a word such as the word constituting the subject in the example is omitted or extra words 
are inserted. With the calculation method described above, because it is presupposed that the first language sentence 
and the example are made up of the same number of words, when the ruimbers of words in the two are different, it has 
been unclear whether or not it is possible to obtain a similarity degree con^ectly reflecting the similarity of the example 
to the first language sentence. 

Also, reversely, even if the first language sentence and the exarrple are made up of the same number of words, 
when their word orders are different, again it has been unclear whether or not it is possible to obtain a similarity degree 
correctly reflecting the similarity of the example to the first language sentence. That is. if the first language is a language 
wherein words can be lined up in a relatively free order like for example Japanese, when for example 'RINGO O 
WATASHI WA TABETAI' {'an apple - I want to eaf) is inputted as the first language sentence, even if for example 
•WATASHI WA f^lKAN O TABETAI' ('I want to eat an orange*) is stored as an example, because the 'RINGO' ('apple') 
and the WATASHI' (T) of the first language sentence respectively correspond to the mTASHI' (T) and the 'MIKAN* 
('orange') of the example, recognizing that the two sentences are similar has been problematic. 

Also, when for example the inputting part 1 consists of a voice recognition device, it sometimes happens that the 
speaker does not speak cleariy and conrect recognition is not achieved in the voice recognition device and text data 
wherein particles and the like are missing is outputted. That is. it sometimes happens for example that although the first 
language sentence intended by the speaker was 'WATASHI WA. ANATA NO I^IKAN GA TABETAI' ('I want to eat your 
orange ) the output of the inputting part 1 becomes -WATASHI. ANATA. MIKAN. TABETAI" (1. you. orange, want to eat'). 
In this kind of case, when for example 'WATASHI WA MIKAN O TABETAI" ('I want to eat an orange) is staed as an 
example, the similarity degree should be calculated with the compound noun 'ANATA NO MIKAN' ('your orange'), con- 
sisting of the 'ANATA* and the 'MIKAN' In the output of the inputting part 1 joined together, being made to correspond 
with the "MIKAN' ('orange') in the example, but with a method using Exp. (1) this has been difficult. 

To handle an Input whose meaning content matches that of an exarrple but which differs from the exanple as It 
were superficially, as described above, it is necessary to add exanples corresponding to these superficial variations to 
the translation exanple memory 53. and as mentioned above this Increases the time required for the searching proc- 
ess. 

Also, the distance word-distanceiik.ej between the word and the word ek has been determined on the basis of 
the dass to which a concept common to the word and the word e^ (the concept which belongs to the kDwest dass 
among the concepts including the word i^ and the word eO belongs In a thesaurus stored in the thesaurus memory part 
55. That is. the thesaurus is constructed by for exanple making the class to which the largest concepts belong the 0th 
class and then classifying smaller concepts Into 1st through 3rd progressively lower classes and classifying words by 
assigning them to the concepts in the 3rd dass to which they correspond, and with reference to this kind of thesaurus 
it is detected which of the 0th through 3rd dasses the lowest concept common to the word and the word e^ belongs 
(hereinafter for convenience the C in Cth class will be referred to as the level). Then, according to whether that concept 
belongs to the 0th. the 1st. the 2nd or the 3rd class, the distance word-distanceOk.eiJ Is determined to be 0. 1/3, 2/3 or 
1 respectively. 

In this case, it is necessary that when the higher classes are seen from the words there are the same nurTt)er of 
dasses (in the case descrtoed above, the four dasses of 0th through 3rd) above each word, and therefore the thesau- 
rus as a whole has had to have an as it were regular structure. 

Also, because the cSstance word<iistance(iK.ek) between two words is determined on the basis of the level of the 
dass to which the common concept of the two words belongs (because it is determined on the basis of which of the 0th 
through 3rd classes the class to which concept belongs Is), the thesaurus must be so constructed that the distance 
between two words having a concept belonging to a certain dass as their common concept and the distance between 
two words having a different concept belonging to the same class as their common concept is the same. 

Fig. 1 6 shows an example of a conventional thesaurus wherein concepts of words are classified Into three dasses, 
0th through 2nd. In the figure, the rectangles represent concepts and the cirdes represent words. 

In this kind of thesaurus, the concept Cl common to the words Wa and Wb Is different from the concept C2 com- 
mon to the words Wc and Wd. but since the concepts Cl and C2 both belong to the 1 st dass. according to the method 
of calculating the distance word-distance(ik.eO of the related art described above, the distance between the words Wa 
and Wb and the distance between the words Wc and Wd are the same. This means that the distance in meaning 
between concepts connected by lines is the same between any two concepts, and therefore it has been necessary for 
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the thesaurus to be so constructed that this is the case. 

However, constructing a thesaurus regularly and also so that the distance between concepts is the same, as 
desaibed above, although it may be possible with a small-iscale thesaurus, is very difficult in a thesaurus of any sub- 
stantial sire. That is. if the thesaurus is constructed regularly it is difficult to make the distance between concepts fixed, 
and if the thesaurus is constructed so that the distance between concepts is fixed it becomes difficult to maintain its reg- 
ularity. 

Therefore, a way of making possible high<^uality translation using a non-regular thesaurus or a thesaurus wherein 
the distance between concepts is not fixed has been being sought. 

Summary of the Invention 

It is therefore an object of the present Invention to overcome the problems described above and provide a translat- 
ing apparatus and a translating method with which it is possible to carry cut high-quality translation easily. 

To achieve the aforementioned object and other objects, the invention provides a translating apparatus for translat- 
ing a first language sentence expressed in a first language into a second language sentence expressed in a second 
language comprising dividing means for dividing the first language sentence into syntax units consisting of predeter- 
mined units of sentence structure, detecting means for detecting examples similar to the syntax units of the first lan- 
guage sentence outputted from the dividing means, translating means for translating the first language sentence in 
syntax units on the basis of translations corresponding to the examples detected by the detecting means, and generat- 
rng means for generating a second language sentence by compounding the results of translation of the syntax units 
obtained by the translating means. 

The invention also provides a translating method comprising a step of dividing a first language sentence into syntax 
units consisting of predetermined units of sentence structure, a step of detecting exanples similar to the syntax units 
of the first language sentence, a step of translating the first language sentence in syntax units on the basis of transla- 
tions corresponding to the detected examples, and a step of generating a second language sentence by corrpomding 
the results of translation of those syntax units. 

The invention also provides a translating method comprising a step of dividing a first language sentence into syntax 
units consisting of predetermined units of senterKe structure in stages from large syntax units to small syntax units and 
collating the syntax units obtained at each stage and examples, a step of detecting examples corresponding to the syn- 
tax units on the basis of results of the collalion. a step of translating the first language sentence in syntax units on the 
basis of translations con-esponding to those detected examples, and a step of generating a second language sentence 
by compounding the results of translation of those syntax units. 

The invention also provides a translating apparatus comprising calculating means for calculating a word similarity 
degree expressing the similarity between a word constituting a first language sentence and a word constituting an 
example on the basis of the number of words included in a concept common to the word constituting the first language 
sentence and the word constituting the example in a thesaurus wherein words are classified in a tree structure on the 
basis of their concepts. 

The invention also provides a translating method comprising a step of calculating a word similarity degree express- 
ing the similarity between a word constituting a first language sentence and a word constituting an exanple on the basis 
of the number of words included in a concept common to the word constituting the first language sentence and the word 
constituting the example in a thesaurus wherein words are classified in a tree structure on the basis of their concepts. 

The invention also provides a translating apparatus comprising detecting means for detecting an example similar 
to a first language sentence on the basis of a priori probability of the example being used for translating the first lan- 
guage sentence and a transformation probability of the example being transformed into the first language sentence and 
used and translating means for translating the first language sentence on the basis of a translation corresponding to the 
example detected by the detecting means. 

The invention also provides a translating method comprising a step of detecting an example similar to a first lan- 
guage sentence on the basis of a priori probability of the example being used for translating the first language sentence 
and a transfomnation probability of the example being transfomied into the first language sentence and used and a step 
of translating the first language sentence on the basis of a translation corresponding to that detected exanple. 

The invention also provides a translating apparatus conrprising extracting means for by performing syntax analysis 
on a first language sentence extracting grammatical attributes of the first language sentence and detecting means for 
detecting an example similar to the first language sentence taking into account grammatical attributes of the first lan- 
guage sentence and of the exanple and translating means for translating the first language sentence on the basis of a 
translation corresponding to the exanple detected by the detecting means. 

The invention also provides a translating method conprising a step of t>y performing syntax analysis of a first lan- 
guage sentence extracting grammatical attributes of the first language sentence and a step of detecting an example 
similar to the first language sentence taking into account grammatical attributes of the first language sentence and of 
the example and a step of translating the first language sentence on the basis of a translation corresponding to the 
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example detected by the detecting means. 

Brief Description of the Drawings 

Fig. 1 is a block diagram showing the construction of an example of a translating apparatus of the related art; 
Fig. 2 is a block diagram showing the construction of another example of a translating apparatus of the related art; 
Fig. 3 is a block diagram showing the construction of another example of a translating apparatus of the related ar*; 
Fig. 4 is a view showing a thesaurus used in a translating apparatus of the related art; 

Fig. 5 is a block diagram showing an example of the construction of a preferred embodiment of a translating appa- 
ratus according to the invention; 

Fig. 6 is a view showing a thesaurus stored in a thesaurus memory 12 of Fig. 5; 

Fig. 7 is a flow chart illustrating the operation of the translating apparatus of Fig. 5; 

Figs. 8A and 8B are further views illustrating the operation of the translating apparatus of Fig. b; 

Figs. 9A through 9C are further views illustrating the operation of the translating apparatus of Fig. 5; 

Figs. lOA through IOC are further views illustrating the operation of the translating apparatus of Fig. 5; 

Fig. 1 1 is a flow chart illustrating a method for obtaining a priori probability P{Ej); 

Fig. 12 is a flow chart illustrating processing carried out by a converting part 7 shown in Fig. 5; 

Fig. 13 is a flow chart illustrating details of the processing of a step S32 in Fig. 12; 

Fig. 1 4 is another flow chart illustrating details of the processing of the step S32 in Fig. 1 2; 

Fig. 15 is a block diagram showing an example of the construction of another preferred embodiment of a translating 

apparatus according to the invention; and 

Fig, 16 is a flow chart illustrating processing carried out in the converting part 7 of Fig. 15 
Detailed D escription of the Preferred Embodiments 

Fig. 5 shows the construction of a preferred embodiment of a translating apparatus according to the invention. 
Parts in Fig. 5 corresponding to parts shown in Fig. 1 have been given the same reference numerals and will not be 
desCTibed In the following. This translating apparatus, as will be further discussed later, divides a first language sen- 
tence into syntax units consisting of predeternraned units of sentence structure, in stages, first into large syrrtax units 
and then into small syntax units, and translates the syntax units obtained at each stage using examples, and in this 
sense it can be called a multi-stage example-using translating apparatus. Here, as the first language sentence for 
exanple a Japanese sentence is inputted, and as a second language sentence constituting the result of translation of 
the first language sentence for example an English sentence is outputted. 

An inputting part 1 of this translating apparatus, like the inputting part mentioned above, consists of for exarrple a 
keyboard or a voice recognition device or a character recognition device, and outputs in the form of text data or the like 
a first language sentence inputted into it. Here, the inputting part 1 detects the beginning and end of each woitJ (or mor- 
pheme) constituting the first language sentence, and its part of speech, root and Chinese character reading and so on 
(hereinafter, this information obtained by morpheme analysis will for convenience be called morpheme information), 
and outputs these in the farm of text data together with the first language sentence. 

The output of the inputting part 1 is supplied to a collating part 2. The collating part 2 collates examples staed in 
a set phrase translation example memory 3 with the first language sentence from the inputting part 1 and determines 
whether or not there is among the examples stored in the set phrase translation example memory 3 one which matches 
the first language sentence. When an example which matches the first language sentence from the inputting part 1 is 
stored in the set phrase translation example menwy 3, the collating part 2 reads from the set phrase translation exam- 
ple memory 3 a translation corresponding to this example and supplies it to an outputting part 16. Thus, in this case, 
processing is not carried out in subsequent stages of a clause dividing part 4 through to a clause compounding part 15. 
The translation from the collating part 2 is outputted by the outputting part 16 as the second language sentence. 

When no example matching the first language sentence from the inputting part 1 is stored in the set phrase trans- 
lation example memory 3. the collating part 2 outputs the first language sentence from the inputting part 1 and its mor- 
pheme information to a clause dividing part 4. 

The set phrase translation example memory 3 stores translation example data made up of examples of for instance 
greetings and other so^alled set phrases expressed in the first language (here, as mentioned above, Japanese) and 
con-e^nding translations consisting of these examples translated into the second language (here, as mentioned 
above, English). Therefore, when the first language sentence is a set phrase (a se* phrase stored a? rn example in the 
set phrase translation exanrtple memory 3), the translation conresponding to the exanple matching that set phrase is 
supplied from the collating part 2 to the outputting part 1 6, and when the first language sentence is not a set phrase this 
first language sentence and its morpheme information is supplied from the collating part 2 to the clause dividing part 4. 

When it receives the first language sentence and the mapheme information from the collating part 2, the clause 
dividing part 4 determines whether or not there is in the first language sentence a clause matching any of the examples 
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stored in a clause division pattern translation example memory 5. and when there is such a clause in the first language 
sentence cuts the first language sentence at this clause and thereby divides the first language sentence into clauses, 
or large syntax units. The clause dividing part 4 supplies parts of the result (clauses) of the division of the first language 
sentence other than the clause which matches one of the examples stored in the clause division pattern translation 
5 example memory 5 to a simple syntax analyzing processing part 6 together with the morpheme intormation. Also, the 
clause dividing part 4 reads from the clause division pattern translation example memory 5 the translation correspond- 
ing to the example matching the clause used to divide up the first language sentence and supplies this together with 
information relating to how the first language sentence was divided up (hereinafter referred to for convenience as clause 
division information) to the clause compounding part 15. 
10 When on the other hand there is in the first language sentence no clause matching an example stored in the clause 
division pattern translation example mennory 5. that is, when the first language sentence consists of a single clause, the 
clause dividing pan 4 supplies the first language sentence unchanged to the simple syntax analyzing processing part 
6 together with its morpheme information. 

The clause division pattern translation example memory 5 stores translation example data made up of exanples in 
/5 clause units expressed in the first language and corresponding translations consisting of these examples translated into 
the second language. Here, examples of so-called patterned clauses such as for example DEKITARA' ('if possible') are 
stored in the clause division pattern translation example memory 5. 

The simple syntax analyzing processing part 6. by carrying out so-called shallow syntax analysis on a dause con- 
stituting the first language sentence from the dause dividing part 4 on the basis of the morpheme information also from 
20 the dause dividing part 4, for example using language-governing rules (knowledge) described as regular grarrvnar. 
divides the dause (when a plurality of dauses are supplied from the clause dividing part 4. each of the clauses) into 
phrase units. That is. here, the simple syntax analyzing processing part 6 divides up the first language sentence dause 
from the clause dividing part 4 into for example noun phrases, verb phrases and other phrases (for example adverb 
phrases and sentence modifiers and so on), and supplies these to a converting part 7. 
25 The simple syntax analyzing processing part 6. by carrying out syntax analysis, also carries out extraction of gram- 
matical attributes for each phrase, such as for example syntactic head (the main word of the phrase (for example the 
important part of a noun phrase (a modified word or the like) or when a noun phrase consists of a single noun, that 
noun) and modality (for example information relating to potential, conditional, desiderative and so on). Also, the simple 
syntax analyzing processing part 6 detects linkage relationships between phrases constituting the dause, and other 
30 syntax infomnation (hereinafter called clause syntax Information). The grammatical attrtoutes and clause syntax infor- 
mation are supplied to the converting part 7 together with the phrases obtained as described above. 

The converting part 7 detects phrase unit examples most similar to the phrases of the first language sentence from 
the simple syntax analyzing processing part 6 from a noun phrase translation example memory 8. a verb phrase trans- 
lation exarrple memory 9 and an other translation example memory 10. and carries out translation in phrase units on 
35 the basis of translations corresponding to these examples. 

That is. the converting part 7 controls a similarity degree calculating part 1 1 to calculate word similarity degrees 
expressing the similarity in meaning between the words in the examples stored in the noun phrase translation example 
memory 8. the verb phrase translation example memory 9 or the other translation example memory 10 and the words 
constituting the phrases from the simple syntax analyzing processing part 6. and on the basis of these word siiralarity 
40 degrees and other parameters (for exannple the grammatical attrtoutes outputted by the simple syntax analyzing 
processing part 6) detects the examples most similar to the phrases from the simple syntax analyzing processing part 
6. In the converting part 7. when an example matehing a phrase from the simple syntax analyzing processing part 6 is 
stored in the noun phrase translation example memory 8. the verb phrase translation example memory 9 or the other 
translation example memory 10. this exanple is detected as the example most similar to the phrase from the simple 
45 syntax analyzing processing part 6. 

Also, by refemng to a translation dictionary 13, the converting part 7 replaces those of the words constituting the 
translatwn corresponding to the most similar example thus detected which do not correspond to words constituting the 
first language sentence with translations of those words, and outputs translations of phrases obtained In this way to a 
phrase corrpounding part 14 together with the dause syntax infamation from the simple syntax analyzing processing 
50 part 6. 

The converting part 7 carries out the processing described above for each noun phrase, verb phrase and other 
phrase sipplied from the simple syntax analyzing processing part 6. Detection of the phrase unit example most similar 
to a noun phrase, a verb phrase or an other phrase is can-ied out with reference respectively only to the noun phrase 
translation example memory 8. the verb phrase translation exarrple memory 9 or the other translation exanple memory 
55 10. 

The noun phrase translation example menriory 8, the verb phrase translation example memory 9 and the other 
translation example memory 10 respectively store examples in noun phrase units, verb phrase units and other phrase 
units. 

The similarity degree calculating part 11. under the control of the converting part 7. on the basis of a thesaurus 
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stored in a thesaurus memory 12. calculates degrees of similarity between the words constituting the examples stored 
in the noun phrase translation example memory 8. the verb phrase translation example memory 9 or the other transla- 
tion example memory 10 and the words constituting the phrases outputted by the simple syntax analyzing processing 
part 6. The thesaurus memory 12 stores a thesaurus wherein words are classified in a tree structure on the basis of 
their concepts. The translation dictionary 13 stores words of the first language in association with corresponding words 
of the second language (words of the first language translated into the second language (translations)). 

The phrase compounding part 14. by compounding the translations of noun phrases, verb phrases and other 
phrases supplied from the converting part 7 on the basis of the clause syntax information also supplied from the con- 
verting part 7 and language-governing rules of the second language, produces clauses of the second language and 
supplies them to the clause compounding part 15. The clause compounding part 15 confounds the clauses and 
thereby produces a second language sentence constituting the result of translation of the first language sentence. 

That is. when the first language sentence is made up of a plurality of clauses, results of translation of a plurality of 
clauses are supplied to the clause compounding part 15 from the phrase compounding part M. Also, a result of trans- 
lation of a clause may be supplied to the clause compounding part 15 not only from the phrase compounding part U 
but also from the clause dividing part 4. By compounding (combining) the results of translation of a plurality of clauses 
supplied to it in this way the clause compounding part 15 produces a second language sentence. The second language 
sentence produced by the clause compounding part 15 is supplied to the outputting part 16 and outputted. 

From the clause dividing part 4, as mentioned above, besides a result of translation (a translation) of a clause, 
clause division information is also supplied to the clause compounding part 15. and the translations of clauses output- 
ted by the phrase compounding part 1 4 are compounded with the translation of the dause outputted by the clause divid- 
ing part 4 on the basis of this clause division information. 

Next, with reference to Fig. 6. the thesaurus stored in the thesaurus menx)ry 12 will be described. In the thesaurus 
memay 12 is stored a thesaurus wherein words are classified in the same way as in for example the EDR (Japan Elec- 
tronic Dictionary Research Institute) concept dictionary, and Fig. 6 shows a part of this thesaurus (Source: 'Japan Elec- 
tronic Dictionary Research Institute. EDR Electronic Dictionary Product Edition (1.0th Edition), 1995'). That is. Fig. 6 
shows those words (enclosed by ellipses in the figure) in the thesaurus stored in the thesaurus memory 12 which can 
be expressed as go" (in the figure not the words themselves but rather their senses are shown), and their higher con- 
cepts (enclosed by rectangles in the figure). 

This thesaurus is not so constructed that when the classes above are seen from the words there are the same 
nun*ef of dasses above each word. That is. when for exanple as in the case shown in Fig. 4 the dass to which the 
highest concept belongs is made the 0th dass and the dasses below it are successively called the 1st dass, the 2nd 
dass. and so on. the word W1 having the meaning (sense) time passes* belongs to the 3rd<Iass concept of 'the pass- 
ing of tinr^e" and the word W2 having the meaning *do gradually' belongs to the 7th-class concept of leave the place one 
has been until now\ Also, in this thesaurus, for exarrple the word W3 having the meaning animal/person dies' and the 
word W4 having the meaning 'arrive at destination* are induded in a plurality of concepts in the same class. That is. the 
word W3 is included in both of the 4th-class concepts of 'physiological phenomena of animals' and 'life going ouV, and 
the word W4 is included in both of the 7th<lass concepts of 'subject moves to end point' and arrival*. 

Thus, this thesaurus is not constructed regularly 

Also, this thesaurus is not so constructed that the distances between concepts joined by lines are the same. That 
is, for example the concept Cl ('subject moves to end point") is connected to both the concepts C2 ('movement of sub- 
ject") and C3 ('arrival at end poinf). but the distance between the concepts Cl and C2 cannot be said to be the same 
as the distance between the concepts Cl and C3. 

In the similarity degree calculating part 1 1 of Fig. 4. as will be further discussed later, word similarity degrees accu- 
rately reflecting similarities between words are calculated notwithstanding that the thesaurus is constructed in the way 
desaibed above. 

Next, with reference to the flow chart of Fig. 7. the operation of the translating apparatus shown in Fig. 5 will be 
described. When a first language sentence is inputted Into the inputting part 1 . in a step SO, the inputting part 1 converts 
this first language sentence into text data and generates morpheme information of the first language sentence and sup- 
plies these to the collating part 2. In a step Si , the collating part 2 collates the examples stored in the set phrase trans- 
lation example memory 3 with the first language sentence from the inputting part 1 and thereby determines whether or 
not there is an example stored in the set phrase translation exanrple memory 3 which matches the first language sen- 
tence from the inputting part 1 . 

When in the step Si it is determined that there is an example matching the first language sentence from the input- 
ting part 1 stored in the set phrase translation example menrxjry 3. processing proceeds to a step S2 wherein the first 
language sentence is replaced with the translation corresponding to this matching exarrple and then proceeds to a step 
S13. In the step S13. the translation obtained in the step Si is supplied from the Donating part 2 to the outputting part 
1 6. which then outputs a second language sentence consisting of the first language sentence translated into the second 
language, and processing ends. 

When on the other hand it is determined in the step Si that there is no example stored in the set phrase translation 
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example memory 3 which matches the first language sentence from the inputting part 1 , this first language sentence is 
supplied together with its morpheme information from the collating part 2 to the clause dividing part 4. Then, in the 
clause dividing part 4. in a step S3, it is determined whether or not there is among the clauses constituting the first lan- 
guage sentence a clause which matches one of the clause unit examples stored in the clause pattern translation exam- 
ple memory 5. 

When in the step S3 it is determined that among the clauses constituting the first language sentence there is one 
which matches a clause unit example stored in the clause pattern translation example memory 5. processing proceeds 
to a step S4 and the clause dividing part 4 cuts the first language sentence at that clause, thereby dividing the first lan- 
guage sentence into clause units, and generates dause division information. The clause dividing part 4 then reads from 
the clause pattern translation example mennory 5 the translation of the clause among the clauses constituting the first 
language sentence which matches a clause unit example stored in the clause pattern translation example memory 5, 
and outputs this translation to the clause compounding part 15 together with the dause division information. Also, the 
clause dividing part 4 supplies the remaining clauses constituting the first language sentence to the simple syntax ana- 
lyzing part 6 together with the morpheme information, and processing then proceeds to a step S5. 

When on the other hand it is determined in the step S3 that there is among the clauses constituting the first lan- 
guage sentence no dause which matches a dause unit example stored in the clause pattern translation exarrple mem- 
ory 5. that is. when for example the first language sentence is made up of a single dause. the dause dividing part 4 
outputs the first language sentence to the simple syntax analyzing part 6 together with the morpheme information and 
processing skips the step S4 and proceeds to the step S5. 

In the step S5. in the simple syntax analyzing part 6. syntax analysis is carried out on dauses constituting the first 
language sentence from the dause dividing part 4 on the basis of morpheme information also from the clause dividing 
part 4 and using language-governing rules (knowledge) of the first language, and as a result of this analysis the dauses 
of tfie first language sentence are divided into noun phrases, verb phrases and other phrases, and grammatical 
attributes of each phrase and dause syntax information such as linkage relationships between phrases constituting the 
clauses are generated. The phrases, the grammatical attributes and the dause syntax information are then supplied 
from the simple syntax analyzing part 6 to the converting part 7 and processing proceeds to a step S6. 

In the step S6, in the converting part 7, each of the phrases supplied from the sinple syntax analyzing part 6 and 
the stored content of the noun phrase translation example memory 8, the verb phrase translation example memory 9 
or the other translation example memory 10 are collated and it is thereby determined whether or not a phrase unit 
example matching a phrase supplied from the simple syntax analyzing part 6 is stored in the noun phrase translation 
example memory 8. the verb phrase translation exanple memory 9 or the other translation example memory 10. When 
in the step S6 it is determined that an exanrple matching a phrase supplied from the simple syntax analyzing part 6 is 
stored in the noun phrase translation example memory 8, the verb phrase translation example memory 9 or the other 
translation example memory 10. processing proceeds to a step S7 and the converting part 7 reads a translation corre- 
sponding to this matching example from the noun phrase translation example menx)ry 8. the verb phrase translation 
example memory 9 or the other translation example mennory 10 and outputs it to the phrase compounding part 14. and 
processing then proceeds to a step SIO. 

Here, noun phrases, verb phrases and other phrases for example obtained from the simple syntax analyzing part 
6 by sentence unit examples being inputted into the inputting part 1 are respectively stored in the noun phrase transla- 
tion example memory 8. the verb phrase translation example vnewory 9 and the other translation exampie memory 10. 
Also, the noun phrases, verb phrases and other phrases obtained in this are stored in association with con-esponding 
grammatical attributes. 

That is, phrase unit exanrples are stored in the noun phrase translation example memory 8, the verb phrase trans- 
lation example menwry 9 and the other translation example memory 10 together with as it were abstract attrixjtes 
thereof obtained as a result of syntax analysis, called grammatical attrtoutes. Therefore, the converting part 7 can be 
given generalized linguistic knowledge and nnade to perform collation of phrases from the simple syntax analyzing part 
6 (hereinafter for convenience called input phrases) with the phrases stored in the noun phrase translation example 
memory 8, the verb phrase translation example menwy 9 or the other translation exanple memory 10 {hereinafter for 
convenience called example phrases) taking into consideration grammatical attributes of the phrases (grammatical 
attrtoutes of the input phrases and of the example phrases), and effident processing becomes possible as a result. 

Specifically, when for example the verb phrase YOYAKU 0 SHITE ITADAKITAI NO DESU GA* ('I'd like you to make 
a reservation*) has been inputted into the simple syntax analyzing part 6, as grammatical attributes (or 'slot-name val- 
ues') for example (gerund-suffix ITADAKU) (desiderative +) (extended -predicate +) {s-pan GA) and (formal +) are 
obtained. Here, the most condse expression (sis it were, the expression equivalent to a root) of the verb phrase 
YOYAKU O SHITE ITADAKITAI NO DESU GA' is *VOYAKU O SURU' ('to make a reservation'), and thp slot-name va'ue 
(gerund-suffix ITADAKU) means that 'ITADAKU' has been added like a suffix to this YOYAKU O SURU\ the slot-name 
value (desiderative +) means that an expression expressing desire (here. TAI') has been added, the stot-name value 
(extended-predicate -*•) means that an expression for adding a predicate (here. 'NO DESU') has been added, the slot- 
name value (s-part GA) means that an ending partide (here, 'GA*) has been added, and the slot-name value (formal +) 
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means that the sentence ending is polite (here, that 'NO DESU has been used as the sentence ending and not for 
example the less polite 'NO DA'). 

In this case, as generalized linguistic knowledge, for example the knowledge that the NO' of the expression corre- 
sponding to the slot-name value (extended -predicate +) sometimes becomes "N' is given to the simple syntax analyzing 
part 6. and when for example 'YOYAKU O SHITE ITADAKITAI N DESU GA' (also meaning Td like you to make a reser- 
vation), wherein the NO' in the vert) phrase mentioned above has become "N*. is inputted as an input phrase, again the 
same slot-name values as in the case above of (g*^rund-suffix ITADAKU) (desiderative +) (extended-predicate +) (s-part 
GA) and (formal +) are outputted. 

Therefore, if the verb phrase 'YOYAKU O SHITE ITADAKITAI NO DESU GA' is stored together with its slot-name 
values as an example phrase in the verb phrase translation example memory 9. in the converting part 7 the example 
phrase YOYAKU O SHITE ITADAKITAI NO DESU GA* is determined to match the input phrase ^OYAKU O SHITE 
ITADAKITAI N DESU GA', 

Also, when ^YAKU O SHITE ITADAKITAI GA* (still meaning Td like you to make a reservation) consisting of the 
first-mentioned verb phrase ^YOYAKU O SHITE ITADAKITAI NO DESU GA* with the expressions corresporxling to the 
slot-name values (extended-predicale +) and (formal +) omitted is inputted as an input phrase, the slot-name values 
(gerund-suffix ITADAKU) (desiderative +) (s-part GA) are outputted by the sirrple syntax analyzing part 6. 

Now. let it be supposed that for example the knowledge that expressions corresponding to the slot-name values 
(extended-predicate +) and (formal +) can be ignored has been given to the converting part 7 as generalized linguistic 
knowledge. Then, assuming that the first-mentioned verb phrase YOYAKU O SHITE ITADAKITAI NO DESU GA' is 
stored together with its slot-name values as an example phrase in the verb phrase translation example memory 9. in 
the converting part 7 collation is carried out ignoring the (extended-predicate +) and (formal +) among the slot-name 
values of the example phrase 'YOYAKU O SHITE ITADAKITAI NO DESU GA*. and as a result the exarrple phrase 
YOYAKU 0 SHITE ITADAKITAI NO DESU GA* is determined to match the input phrase 'YOYAKU O SHITE ITADAKITAI 
GA'. 

In this way. even if an input phrase does not match an exarrple phrase completely it is possible to obtain an exam- 
ple phrase expressing the same meaning as the input phrase. That is. by taking grammatical attributes into considera- 
tion. rt is possible to carry out processing in the converting part 7 Ignoring the 'NO DESU' in the example phrase. 

Also, as a result of this, it becomes unnecessary to provide example phrases corresponding to supertkaal varia- 
twns: that is. it becomes unnecessary to store in the verb phrase translation example memory 9 as exarrple phrases in 
addition to the verb phrase YOYAKU O SHITE ITADAKITAI NO DESU GA* the verb phrase YOYAKU O SHITE ITADAK- 
ITAI GA' with the 'NO DESU' removed, the verb phrase YOYAKU O SHITE ITADAKITAI DESU GA* with the 'NO* 
removed or the verb phrase YOYAKU O SHITE ITADAKITAI 'N DESU GA' wherein the NO* has been changed to **tM*. 
and therefore example phrases can be stored in the noun phrase translaHon example memory 8. the verb phrase trans- 
lation exarrple memory 9 and the other translation exanple menrory 10 effkaently Also, the noun phrase translation 
example memory 8. the verb phrase translation example memory 9 and the other translation example memory 10 can 
be called a data base of example phrases, and when the approach described above is adopted it becomes possible to 
carry out the production, maintenance and improvement of this data base easily. 

When on the other hand It is determined in the step S6 that no example phrase in the noun phrase translation 
example memory 8. the verb phrase translation example memory 9 or the other translation exanple memory 10 
matches an input phrase (a phrase supplied from the simple syntax analyzing part 6). processing proceeds to a step 
S8 and the converting part 7. by controlling the similarity degree cateulating part 1 1 . causes word similarity degrees to 
be calcdated and on the basis of these word similarity degrees and other parameters computes translation suitabilities, 
which vwll be further discussed later, expressing the similarity (in meaning) between the input phrase and each of the 
exanple phrases stored in the noun phrase translation exanple memay 8. the verb phrase translation exarrple mem- 
ory 9 or the other translation exanple memory 10. The corrverting part 7 then detects the example phrase of which the 
translation suitability is the highest and processing proceeds to a step S9. 

In the step S9, the converting part 7 produces a translation (hereinafter for convenience called an input phrase 
translation) of the input phrase on the basis of the translatton corresponding to the example phrase detected in the step 
S8 and with reference to the translation dictionary 13, and outputs it to the phrase compounding part 14 together with 
the clause syntax information from the sinple syrrtax analyzing part 6. 

Here, the processing of the aforementioned steps S6 through S9 is carried out with respect to the exarrple phrases 
stored in either the noun phrase translation exanple memory 8, the verb phrase translation examp*e memory 9 or the 
other ti^nslation exarrple n>emory 10, depending on whether the input phrase is a noun phrase, a verb phrase or an 
other phrase respectively Therefore, because thp example phrases that are made objects of processing are limited 
according to the type of the input phrase, it is possible to increase the translation processing speed. 

When the phrase compounding part 14 has received translatwns (input phrase translations) of all the input phrases 
constituting a certain dause, in a step SlO it generates a clause of the second language by conpounding these input 
phrases on the basis of. clause syntax information also supplied from the converting part 7 and stpplies it to the clause 
compounding part 15. 
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After that, processing proceeds to a step Si 1 and the clause compounding part 15 determines whether or not the 
first language sentence is made up of a plurality of clauses. When in the step Si 1 it is determined that the first language 
sentence is made up of a plurality of clauses, that is. when a plurality of clauses (second language clauses) have been 
supplied to the clause compounding part 15 from the phrase compourxling part 14. or wnen clauses have been sup- 
plied to the clause compounding part 15 from both the clause dividing part 4 and the phrase compounding part 14, 
processing proceeds to a step Si 2 and the clause compounding part 15 generates a second language sentence by 
compounding these clauses and supplies it to the outputting part 16. and then processing proceeds to a step Sl3. 

When in the step Si 1 it is determined that the first language sentence is not made up of a plurality of clauses. That 
is, when only one clause has been supplied to the Ciduse compounding part 1 5 from the phrase compounding part 14. 
the clause compounding part 15 supplies this clause (a second language sentence) unchanged to the outputting part 
16 and processing skips the step Sl2 and proceeds to the step S13. In the step S13. the second language sentence 
supplied from the clause compounding part 15 Is outputted. and processing er»ds. 

Because, as described above, in the clause dividing part 4 and the simple syntax analyzing part 6 the first language 
sentence is divided In stages first Into clauses, which are large syntax units, and then into phrases, which are small syn- 
tax units, and translation of the syntax units obtained at each stage, i.e. clauses and phrases, is carried out using exam- 
ples. cotTTpared to a case wherein translation of the entire first language sentence is carried out in one stage, it is 
possible to Increase the translation processing speed. 

In the case described above, division of the first language sentence Into clauses is carried out when there Is among 
the clauses constituting the first language sentence a clause which matches a clause unit example stored in the dause 
pattern translation example memory 5 and on the basis of this clause; however, division into clauses can also be carried 
out irrespective of this kind of condition. 

Next, the processing carried out by the translating apparatus of Fig. 5 when for example 'DEKIMASHITARA 
ICHIGO NO AISU KUREEMU O ONEGAI SHITAI V DESU GA* ('If possible, may I have some strawberry ice cream?) 
is inputted will be desaibed with reference to Figs, 8 A through IOC. 

By morpheme analysis being carried out in the inputting part 1. for example the kind of morpheme information 
shown in Fig. 8A is extracted from this first language sentence 'DEKIMASHITARA ICHIGO NO AISU KUREEMU O 
ONEGAI SHITAI 'N DESU GA*. In Fig. 8A, for reference, modality case information in the grammatical attributes of the 
first language sentence is also shown. 

The first language sentence 'DEKIMASHITARA ICHIGO NO AISU KUREEMU O ONEGAI SHITAI 'N DESU GA* 
and its morpheme information are supplied from the inputting part 1 to the collating part 2. Now. assuring that 'DEKI- 
MASHITARA ICHIGO NO AISU KUREEMU O ONEGAI SHITAI 'N DESU GA' is not stored as an example in the set 
phrase translation example memory 3, the first language sentence 'DEKIMASHITARA ICHIGO NO AISU KUREEMU O 
ONEGAI SHITAI N DESU GA' and its morpheme information are supplied from the collating part 2 to the dause divid- 
ing part 4. 

In the clause dividing part 4. collation of the first language sentence 'DEKIMASHITARA ICHIGO NO AISU 
KUREEMU O ONEGAI SHITAI 'N DESU GA' with the dause unit examples stored in the dause pattern translation 
example memory 5 Is carried out. Here, supposing now that for example translation exanple data of the kind shown in 
Fig. 8B made up of an example ^DEKIMASHITARA X' and Its translation 'X. if possible' are stored in correspondence 
with each other in the dause pattern translation exannple memory 5. because in the first language sentence 'DEKIMAS- 
HITARA ICHIGO NO AISU KUREEMU O ONEGAI SHITAI 'N DESU GA' there is a dause matching the example 'DEKI- 
MASHITARA' (in this case. 'ICHIGO NO AISU KUREEMU 0 ONEGAI SHITAI 'N DESU GA' is recognized as matching 
the X part), the first language sentence 'DEKIMASHITARA ICHIGO NO AISU KUREEMU O ONEGAI SHITAI 'N DESU 
GA' is divided up in the clause dividing part 4 into the clauses 'DEKIMASHITARA* and 'ICHIGO NO AISU KUREEMU 
O ONEGAI SHITAI 'N DESU GA'. 

The translation 'X. if possible' caresponding to the clause 'DEKIMASHITARA* matching a dause unit exarrple 
stored in the dause pattern translation example memory 5 is then supplied from the clause dividing part 4 to the dause 
compounding part 15. Also, the remaining clause of the first language sentence. 'ICHIGO NO AISU KUREEMU O 
ONEGAI SHITAI 'N DESU GA', is supplied from the dause dividing part 4 to the simple syntax analyzing part 6. 

In the simple syntax analyzing part 6. by the syntax of the clause 'ICHIGO NO AISU KUREEMU O ONEGAI SHITAI 
'N DESU GA' being analyzed, it is recognized that 'ICHIGO NO AISU KUREEMU 0' constitutes one noun phrase and 
'ONEGAI SHITAI V DESU GA' constitutes one verb phrase. Also, in this case, it is recognized that the noun phrase 
'AISU KUREEMU O' is the Object of the verb phrase "ONEGAI SHITAI 'N DESU GA'. On the basis of these recognition 
results, the dause 'ICHIGO NO AISU KUREEMU O ONEGAI SHITAI 'N DESU OA' is divided into phrase units. 

Here, for example although the translation of the verb TORU' in the sentence OKANE O TORU' (1o take mone/) 
is 'take* or 'get', the translation of the verb TORU' in th9 sentence 'CHOUSHOKU O TORU* ('to have breakfast*) is "have' 
or 'eat'. Thus with a verb (or a verb phrase) it is sometimes difficult to determine its translation if the object, subject or 
complement is not known. For this reason, in the simple syntax analyzing part 6. in cases like this, when necessary an 
object, subject or complement or the like is induded with the verb (predicative part) to make one verb phrase. 

That is. in this case, the dause 'ICHIGO NO AISU KUREEMU 0 ONEGAI SHITAI 'N DESU GA' is divided into the 
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noun phrase ICHIGO NO AISU KUREEMU O' and the verb phrase, including an object, of 'AISU KUREEMU O 
ONEGA) SHITAI "N DESU GA*. 

Also, in the simple syntax analyzing part 6. grammatical attributes of the phrases are generated (recognized) and 
assigned to the phrases. 

That is. for the verb phrase ■ICHIGO NO AISU KUREEMU O*. for example slot-name values to the eHect that 
•ICHIGO NO- IS a modifier of 'AISU KUREEMU O' and 'AISU KUREEMU O" is modified by ICHIGO NO' are produced 
(however, because in this case these grammatical attributes are not used in subsequent processing they will not be 
referred to particularly in the following). 

Also, for the vertD phrase 'AISU KUREEMU O ONEGAI SHITAI 'N DESU GA'. as shown in Fig. 9A. a slot-name 
value (root ONEGAI SURU' + desiderative) meaning that 'ONEGA! SHITAI 'N DESU GA* is a desiderative form of the 
root ONEGAI SURU" and a slot-name value (NP1 AISU KUREEMU O*) meaning that Its object is ice cream (a food) 
are produced. 

The noun phrase 'ICHIGO NO AISU KUREEMU O* and the verb phrase 'AISU KUREEMU O ONEGAI SHITAI N 
DESU GA- and their grammatical attributes thus obtained are supplied from the simple syntax analyzing part 6 to the 
converting part 7. 

In the converting part 7. each of the phrases supplied from the simple syntax analyzing part 6 and the stored con- 
tent of the noun phrase translation example memory 8. the verb phrase translation exanple memory 9 or the other 
translation example memory 10 are collated and the example phrases rTX)st similar to the phrases supplied from the 
ample syntax analyzing part 6 are detected (searched) from the noun phrase translation example memory 8. the verb 
phrase translation example memory 9 or the other translation example memory 10. 

That is. when for instance in the verb phrase translation example memory 9 the example TANAKA SAN O ONEGAI 
SHITAI 'N DESU GA* ( Mr. Tanaka, please') is stored in association with the slot-name values (root 'ONEGA! SURU* + 
desiderative) and NP1 TANAKA SAN O' and also in correspondence with the translation of this, 'may I speak to NPI'?* 
as shown in Fig. 9B. and the example 'KOUHEE O IPPAI ONEGAI SHITAI 'N DESU GA' is stored in association with 
the slot-names (root 'ONEGA! SURU' + desiderative) and NP1 'KOUHEE O IPPAI* and also in correspondence with the 
translation of this, 'may t have NP1 ?'. as shown in Fig. 9C. then from a comparison of the distance in meaning between 
the objects 'AISU KUREEMU O* and TANAKA SAN O' and the distance in meaning between the objects 'AISU 
KUREEMU 0- and KOUHEE 0 IPPAP. the exanrple 'KOUHEE O IPPAI ONEGAI SHITAI N DESU GA' is detected as 
the exariple most similar to the verb phrase 'AISU KUREEMU O ONEGAI SHITAI DESU GA'. 

The translation 'may I have NPI r conresponding to this example XOUHEE O IPPAI ONEGAI SHITAI 'N DESU GA' 
is then read from the verb phrase translation example memory 9 and supplied from the converting part 7 to the phrase 
conpounding part 14 as the translation of the verb phrase 'AISU KUREEMU O ONEGAI SHITAI 'N DESU GA'. 

Also, in the converting part 7. supposing that for instance the example 'SAKUJITSU NO GYUUNYUU* and its trans- 
lation yesterday's milk', the exarrple WATASHI NO HON' and its translation 'my book' and the example 'KOUHEE NO 
AISU KUREEMU' and its translation 'coffee flavored ice cream' are respectively stored in correspondence with each 
other in the noun phrase translation example memory 8. as shown in Fig. 10A. 'KOUHEE NO AISU KUREEMU* is 
detected as the exanple most similar to the noun phrase 'ICHIGO NO AISU KUREEMU 0' from the simple syntax ana- 
lyzing part 6. and the translation 'coffee flavored ice aeam* corresponding to this is read from the noun phrase transla- 
twn example menrx3ry 8 

In this case, in the converting part 7 it Is recognized that in the noun phrase 'ICHIGO NO AISU KUREEMU O' the 
•KOUHEE* in the example 'KOUHEE NO AISU KUREEMU' has become 'ICHIGO*. and ICHIGO*. which Is not in the 
example, is found up by referring to the translation dictionary 13. 

Here, it will be assumed that 'ICHIGO* and its translation 'strawberry* are stored in correspondence with each other 
In the translation dicttonary 13 as shown in Fig, 10B. 

When the converting part 7 finds 'ICHIGO' In the translation dictionary 13, it reads out the strawberry* stored in cor- 
respondence with it. Then, the translation 'coffee' of the 'KOUHEE', corresponding to 'ICHIGO*. In the translation 'coffee 
flavored Ice cream* read from the noun phrase translation example memory 8. Is replaced with the 'strawben-y' read 
from the translation dictionary 13, and the translation 'strawberry flavored k;e cream* of the noun phrase 'ICHIGO fslO 
AISU KUREEMU O* is thereby generated. This translation 'strawberry flavored ice cream' is supplied from the convert- 
ing part 7 to the phrase compounding part 1 4. 

In the phrase conpounding part 1 4. the translation 'strawberry flavored ice cream* of the noun phrase 'ICHIGO NO 
AISU KUREEMU O' and the translation 'may I have NPI?' of the verb phrase 'AISU KUREEMU O ONEGAI SHITAI 'N 
DESU GA* are compounded and the translation 'may I have strawberry flavored ice aeam? of the clause 'ICHIGO tslO 
AISU KUREEMU O ONEGAI SHITAI 'N DESU GA' is thereby generated, and this is supplied to the clause compound- 
ing part 15. 

When the clause compouncfing part 15 receives the translation 'may I have strawben-y flavored ice cream?' of the 
clause 'ICHIGO NO AISU KUREEMU 0 ONEGAI SHITAI 'N DESU GA* and the translation *X. if possible* of the dause 
'DEKIMASHITARA'. it compounds them. In this way. the second language sentence 'may I have strawberry flavored ice 
cream, if possible?', consisting of the first language sentence 'DEKIMASHITARA ICHIGO NO AISU KUREEMU O 
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ONEGAI SHITAI "N DESU GA' translated into the second language, is generated. 

Next, the method of detection of the example phrases most similar to the input phrases used in the converting part 
7 will be described in detail. Now, the words constituting an input phrase I will be expressed, from the beginning of the 
phrase, as iwi. iw^. • • • . iw^, and their slot-name values will be expressed as if,, if;.. - • • , ifQ. Similarly, the words 

5 constituting an example phrase E will be expressed, from the beginning of the phrase, as ew^. ew^. • • • . ew^. and 
their slot-name values will be expressed as efi. ef2. • • efp Here. M and N respectively are the numbers of words 
constituting the example phrase E and the input phrase I. and P and Q respectively are the numbers of grammatical 
attributes of (extracted from) the example phrase E and the input phrase I. 

The example most similar (similar in meaning) to an input phrase is the one most suitable to be used in translation 

10 processing of that input phrase. Now. if the probability of a certain example phrase E being suitable for translation of an 
input phrase I is called its translation suitability, this translation suitability can be expressed as a conditional probability 
of the example phrase as P(Example|lnput). Here. P() expresses probability and Example and Input respectively mean 
a set of example phrases and a set of input phrases. 

The example phrase most similar to an input phrase is that for which this translation suitability P(Example| Input) is 

fs the highest, and when this is written E^^^. this example phrase E^^, can be obtained from Exp. (2). 

E^„=maxE,Example[P(E|l)] (2) 

The right side of Exp. (2) means an operation obtaining the E for which the value inside the 0 'S a maximum. 
20 The translation suitability P(Example| Input) can be expressed according to Bayes law as shown in Exp. (3). 

P(Example|lnput)=P(Example)P(lnput|Example)/P(!nput) (3) 

Here, when finding the example phrase E (Ema^) most similar to a certain input phrase I. because there is only one 
25 input phrase I. that is. Input - I. the denominator P(lnput) of Exp. (3) is 1 and can be ignored. Therefore, from Exp. (2) 
and Exp. (3). it is possible to obtain the example phrase E^gx "lost similar to the input phrase I using the following Exp. 
(4). 

E ^=max Example(P(E)P(l|E)] (4) 

30 

From Exp. (4), to obtain the example phrase having the greatest translation suitability with respect to the input 
phrase, it is necessary to obtain the probabilities P(E) and P(I|E). 

Here, the probability P(E) represents the priori probability of the example E (Ee Example) being used for translation. 
The probability (conditional probability) P(I|E) represents the probability (hereinafter for convenience referred to as the 
35 transformation probability) of the example phrase E being transformed into the input phrase I and used. 

The priori probability P{E) must con-espond to the. suitability of the exanple phrase E being used in any input 
phrase, and therefore should for example reflect the following kinds of factor. That is. first it is necessary to reflect in the 
priori probability P(E) the factor of what kinds of expression are frequently used in the domain (the type of expression 
(writing) inputted into the translating apparatus (for example writing appearing in machine manuals, or newspaper arti- 
40 cles)) in which the translating apparatus is to be applied. 

Specifically, example phrases E corresponding to expressions frequently used in the donr^in must be given higher 
priori probabilities P(E) than exanrple phrases E corresponding to expressions only rarely used and expressions having 
meaning content having little relation to the domain. 

Next, it is necessary to reflect in the priori probability P(E) the factor of what other example phrases are stored in 
45 the translating apparatus besides that example phrase E (in the translating apparatus of Fig. 5. because the exanple 
phrases are divided into noun phrases, vert phrases and other phrases and respectively stored in the noun phrase 
translatbn example memory 8. the verb phrase translation example memory 9 and the other translation example mem- 
ory 10, depending on whether the input phrase is a noun phrase, a verb phrase or an other phrase this becomes the 
factor of what kinds of example phrases are stored in the noun phrase translation example memory 8. the verb phrase 
50 translation example memory 9 or the other translation example memory 10), Specifically, for example it is necessary to 
gh/e to extremely general example phrases E of which there are almost no other similar example phrases a higher priori 
probability P(E) than special example phrases and exanple phrases of which there are many similar example phrases. 

A priori probability P(E) in which these kinds of ^ctor are suitably reflected can for example be obtained using cor- 
pus data (training corpus data), which is a set of language expressions typically used in the domain in which the trans- 
55 lating apparatus is to be applied. 

That is. for example accurately selecting from among the example phrases stored in the translating apparatus 
those example phrases which are most suitable for translating corpus data is carried out manually or by other means 
for all the corpus data. In this case, if the number of example phrases stored in the translating apparatus is expressed 
Ne. the total number of times a selection is made from among these exanrple phrases is expressed T and the number 
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of times an example E, (i = i. 2. • • • . Ng) is selected is expressed freq (Ej). it is possible to obtain the priori probability 
P(Ei) for each example Ej for instance from Exp. (5). 

freq(E:) 

P(Ea.^ (5) 



Here, when selecting the example phrases most suitable for translation of corpus data manually is problematic, i» 
is possible to obtain approximate values of the priori probability P(Ej) in Exp. (5) by making the translating apparatus 
10 translate the corpus data. 

That is. as shown in the flow chart of Fig. 1 1 . first, in a step S21 , the priori probabilities P(E,) are all initialized to for 
example l/Ng. Then, in a step S22. using this priori probability P(Ei), the translating apparatus of Fig. 5 is made to trans- 
late all the corpus data, and at this time the number of times F(Ej) that each example phrase (E,) is used Is counted. 
Then, in a step S23. the priori probabilities P(Ei) are renewed for example according to the expression P(E j) = F{E . 
15 where L is the summation of F(Ej) over i = 1 to N^. 

After the priori probability P(Ej) has been renewed for alt the example phrases Ej. processing proceeds to a step 

524 artd detects the example phrase Ei(max) 9'ven the greatest priori probability. Processing then proceeds to a step 

525 and determines whether or not the priori probability P(Ej(max)) oi the example phrase E,|n,a,j is smaller than a pre- 
determined threshold value, and when it is not smaller returns to the step S22 and repeats the steps S22 through S25. 

20 When the priori probability (Ej^rna^)) 'S smaller than the predetermined threshold value, the priori probabilities (Ej) at that 
time are outputted as the priori probabilities for the example phrases Ej and processing ends. 

When obtaining corpus data is problematic, it is also possible for example to obtain priori probabilities P(Ej) in the 
following way using the example phrases stored in the translating apparatus. 

That is. if anx)ng the words constituting an example phrase Ej for which a priori probability P(E,) is to be obtained 

25 the number of wads that match words constituting another example phrase is expressed W^ and anrx)ng the slot- 
names of the example phrase (EJ the number of slot-names had by another exannple is expressed F^, the generality of 
that example phrase Ej, or its similarity sim-score (EJ to other examples, can for exannple be estimated as (WN+FN)/(the 
number of words constituting the example phrase Ej). In this case, the priori probability P(Ei) can for example be 
obtained according to Exp. (6). 



30 



P<^'>- sim.score(E/ N: ^ 

jLsim-score(Ep 



35 



Next, the transformation probability P(IIE) can be obtained in the following way. When a user expresses the mean- 
ing content of an example phrase E. it can be considered that with a certain high probability the user will use an expres- 
sion with words and phrases the same as the example phrase E. with a slightly lower probability the user will use an 

40 expression only slightly different from the example phrase E and with a lower probability the user will use an expression 
greatly different from the example phrase E. 

Therefore, the input phrase I can be regarded as a variation produced as a result of the expression of the exanple 
phrase E being distorted in the language production process of the user. In this connection, the following kinds of trans- 
forming operators for transforming (distorting) the example phrase E will t>e introduced. 

45 That is, as operators operating on the words constituting the example phrase E. a word copying operator echo-word 
(ewm,iwn), a word deleting operator delete-word(eWfT,), a word inserting operator add-word(iwn) and a wad replacing 
operator alter-wad(ewm,iwn) will be introduced. Also, as operatas operating on the slot-name values of the example 
phrase E, a slot-name value copying operator echo-feature(efp(fq), a slot-name value deleting operator delete-fea- 
tuce(efp), a slot-name value inserting operator add-feature(lfq) and a slot-name value replacing operator alter-fea- 

50 ture(efp,iy will be introduced. 

Here, the word copying operator echo-word(ew^,iWn) is an operator which without changing it makes the mth (nrrth 
from the beginning) word eWf^ of the exarrple phrase E the nth wad iw^ of the input phrase I. The word deleting oper- 
ata detete-word(eWrT,) is an operata which deletes the mth word ew^ of the example phrase E. The word irtserttng 
operator is an operator which Inserts (adds) the nth word iw„ of the input phrase I to the example ph'qse E. The word 

55 replacing operator alter-word(ew„,iwn) is an operata which replaces the mth word ew^ of the example phrase E with 
the nth word iw^ of the input phrase I (here, the word iw,, is dose in meaning to but different from the word ew^. 

The slot-name value copying operata echo-feature(efp.ifq) is an operator which without changing it makes the ptii 
slot-name value of the example phrase E (in the case of slot-name values, their position in the phrase does not partic- 
ularly affect processing and therefae here the order of the slot-name values is a matter of convenience) the qth slot- 



14 



EP 0 805 403 A2 



name value if^ of the input phrase I. The slot-name value deleting operator delete-feature(efp) is an operator which 
deletes the pth slot-name value efp of the example phrase E. The slot-name value inserting operator add-feature(ifq) is 
an operator which inserts the qth slot-name value if^ of the input phrase I into the example phrase E. The slot-name 
value replacing operator alter-feature(efp.ifq) is an operator which replaces the pth slot-name vRiue of the example 
phrase E with the qth slot-name value ifq of the irput phrase I (here, the slot-name value ifq is close in meaning to but 
different from the slot-name value efp). 

Now. if the aforementioned transformng operators as a whole are expressed distort^ and that which is obtained by 
Z transforming operators distort,. distort2. • • • . distort^ being applied to the example phrase E the input phrase I. 
the input phrase I can be defined as the set of these transforming operators, as shown in Exp. (7). 

l={distort pdistortj. • • • .distort^) (7) 

There may be more than one way of transforming the example phrase E into the input phrase I. but here it is suffi- 
cient to consider just a transformation following a most probable direct route. 

From Exp, (7). the transformation probability P(I|E) can be expressed as shown in Exp. (8). 

P(l|E)=P({distort,.distort2, • - • .distort,}|E) (8) 

Here, as shown in Exp. (9), the transforming operators can each be regarded as independent. 

P(distortjdistort,.distort2. • - • • .distort 2:E)-P{distort J E) (9) 

Therefore, from Exp. (9). Exp. (8) can be simplified as shown in Exp. (10) and as a result the transformation prob- 
ability P(I|E) becomes as shown in Exp. (11). 

2 

P({distorti. distort 2. .distort 2)|E)«J-JP(distort J E) (io) 

Z 

pi»iE)-np(^'s^^2iE) (11) 

z-1 



Also, because the example phrase E can be expressed as the set of the words ew^ (m = l. 2. • • • . M) arxJ the 
slot-name values efp (p = 1, 2. • • • . P). Exp. (11) becomes as shown in Exp. (12). 

z 

P(l|E)-fIP(distortj|ew,.ew2.' • • • .ew,^^:ef ^.efg. • • • - .efp) (12) 
z-1 



From Exp. (12), if the probability P(distortkl ew^.ewg. • • •.ew^; ef,.ef2, • • • ,efp) is known for the cases wherein 
the transforming operator distortk is respectively the word copying operator echo-word(ewm,iWn). the word deleting 
operator delete-word(eWr„), the word inserting operator add-wortJ(iWn). the word replacing operator alter- 
word(ewr„.iWn), the slot-name value copying operator echo-f6ature(efp,ifq). the slot-name value deleting operator 
delate-fQature(efp). the slot-name value inserting operator add-foature(ffq) or the slot-name value replacing operator 
atter-feature(efp>ifp), the transtormation probability P(I|E) can be obtained. 

Here, the following assumptions can be made about the transforming operators, and as a result it is possible to sim- 
plify the probability P(distortk| ew^.ewg, • • • ,ew,^; ef^.efg. • • • .efp) for each of the transforming operators. 

That is. in the case of the word copying operator €Cho-word(eWf„,iWn), the probability of the mth word ew^ of the 
example phrase E being made the nth word iw,, of the input phrase I without being changed can be assumed to depend 
only on these words ew^,. iw^. Therefore, the probability P(echo-word(eWni.'Wn)| €Wi.ew2. • • • .ew^; eft.efg, • • '.efp) 
can be simplified as she* .i in Exp. (13). 

P(echo-word(ew„.lw„)|ewT.ew2.* • .ew^ief i.efg. • '.efp))- P(echo-word(ew^,iWn)) (13) 

Because the probability shown in Exp. (13) is the probability of the word ew^ being as it were copied as a word of the 
input phrase I without being changed, hereinafter it will for convenience be referred to as the word copy probability. 
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In the case of the word deleting operator delete-word(ew^). it can be assumed that the probability of the mth word 
ew„, of the example phrase E being deleted depends only on that word ew,„. Therefore, the probability P(delete- 
word(ew Jl ew,.ew2. • • ■ .ew^; efi.efs. • • • .efp) can be simplified as shown in Exp. (14). 

P(delete-word{ew^)|ewi.ew2. • • .ew„:ef i.ef • • .ef p))-P(delete-word(ew„)) (i4) 

Because the probability shown in Exp. (14) is the probability of the word ew^ being deleted, hereinafter it will for con- 
venience be referred to as ti.e word deletion probability. 

In the case of the word inserting operator add-word(iw„). it can be assumed that the probability of the nth word iw^ 
of the input phrase I being inserted into the example phrase E depends only on that word iw„,. Therefore, the probability 
P(add-word(iw„)| ew,.ew2. • • • .ew^: efi.efg. • • • .efp) can be simplified as shown in Exp. (15). 

P(add-word(iw^)|ew ,.ew2. • • .ew„;ef ,.ef • • .ef p))-P(add-word(iw J) (15) 

Because the probability shown in Exp. (15) is the probabilrty of the word iw„ being inserted (added), hereinafter it will 

for convergence be referred to as the word insertion probability (or the word addition probability). 

In the case of the word replacing operator alter-word(ew^.iwn). it can be assumed that the probability of the mth 

word ewp, of the example phrase E being replaced with the nth word iw^ of the input phrase I depends only on those 

words ew^ and iw^. Therefore, the probability P(alter-word(ew^.iWn)|ewi.ew2.- • -.eww; efi.ef2. • • -.efp) can be sim- 
plified as shown in Exp. (16). 

P(alter-word(ew^.iw„)|ew T.ewg, • • .ewj^:ef ^.ef • • .ef p))«P(alter-word(ew^.iw„)) (16) 

Because the probability shown in Exp. (1 6) is the probability of the word ew^, being replaced with the word iw^. herein- 
after it will for convenience be referred to as the word replacement probability. 

Also In the cases of the slot-name value copying operator echo-feature(efp.rfq). the slot-name value deleting oper- 
ator delete-feature(efp), the slot-name value inserting operator add-feature(ifq) and the slot-name value replacing oper- 
ator alter-feature(efp.rfq) expressing copying, deleting, inserting and repladng of slot-name values, the same 
assumptions can be made as when the words in the word copying operator echo-word(ewm.iwn). the wad deleting 
operator delete-word(ew^). the word inserting operator add-wordOw^) and the wad replacing operator alter- 
word(ewn^.iwn) expressing copying, deleting, inserting and replacing of words are replaced with slot-rame values. 

As a result, the probability of the slot-name value efp being copied (hereinafter for convenience referred to as the 
slot-name value copy probability) P(echo-feature(efp.ifq)| ew,,ew2, • • - .cwm; ef^.efg. • • - .efp). the probability of the 
slot-name value efp being deleted (hereinafter fa convenience referred to as the slot-name value deletion probability) 
P(delete-feature(efp)| ew^.ewg, • • • .ew^: efi.ef2. • • - .efp). the probability of the sbt-name value ifq being inserted 
(hereinafter fa convenience referred to as the slot-name value insertion probability) P(add-feature(ifq)| 

ewi,ew2. .cwm: efi.ef2. • • • .efp) and the probability of the slot-name value efp being replaced with the slot-name 

value ifq (hereinafter for convenience referred to as the slot-name value replacement probability) P(alter-feature(efp.ifq)| 
ewi.ewg. • • • .ew^: efi,ef2. • • • .efp) can respectively be simplified as shown in Exps. (17) through (20). 



P(echo-feature(efp.ifq)|ew ^.ewj. • • .ew^ief ,.ef • • .efp))- P(echo-feature(efp.if q)) (17) 

P(ddete-feature(efp)|ew ^.ewg. • • ,ew^^;ef i,ef2. • • .ef p))«P(delete-feature(efp)) (18) 

P(add-feature(ifq)|ew ^.ewj. • • .ew„;ef ^.efj. • • .ef p))-P(add-feature(ifq)) (19) 

P(alter-featuro(efp.ifq)|ew i.ewj. • • .ow^^;ef ^.efj. • • .ef p))-P(alter-feature(efp.if q)) (20) 



From the above, if the word copying operator echo-word(ewm jw^), the wad deleting operator delete-word(ew^). 
the word inserting operator add-word(iWn) and the word replacing operator alter-word(ew^,iWn), which are transfaming 
operators fa words (and hereinafter for convenience referred to as word transforming operators), are expressed as a 
whole as distort-word,(ewn,.iWn) and the stot-nanr>e value copying operator echo-feature(efp.ifq), the slot-name value 
deleting operata delete-feature(efp), the slot-name value inserting operata add-feature(ifq) and the slot-name value 
repladng operator alter-feature(efp.ift^. which are transfaming operators fa slot-name values (and hereinafter for con- 
venience referred to as slot-nanr>e value transfaming operatas) are expressed as a whole as distort-featurey (efp.ifq). 
the transformation probability P(I|E) expressed in Exp. (11) can be expressed with the product of the transformation 
probabilities (hereinafter for convenience referred to as the word transformation probability) P(distort-wordj,(ewm,iwn)) 
of ttie wad transforming operator distort-wordx(ewm,iwn), and the product of the transformation probabilities (hereinaf- 
ter for convenience referred to as the slot-name value transformation probabilities) P(distort-featurey(efp.ifq)) of the slot- 
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name value transforming operator distort-featurey(efp.ifq). 




(21) 



Here. X and Y respectively are the total numbers of word transforming operators and slot-name value transforming 
operators applied, and X+Y = 7 (Exp. 11)). 

Next, the methods by which the word cony probability P(echo-word(ew^.iWri)). the word deletion probability 
P(delete-word(ew^)). the word insertion probability P(add-word(iWn)). the word replacement probability P(alter- 
word(ew^.iwj). and the slot-name value copy probability P(echo-feature(efp.ifq)). the slot-name value deletion proba- 
bility P(delete-feature(efp)). the slot-name value insertion probability P{add-feature(ifq)) and the slot-name value 
replacement probability P(alter-feature(efp.if(,)) are calculated will be described. 

First, the word cqpy probability P(echo-wond(eWm.tWn)). the word deletion probability P(delete-word(ew^)) and the 
word insertion probability P(add-word(iw„)) can be regarded as reflecting human language production behavior. That 
is. some types of word have a greater or lesser probability of being copied, deleted or inserted than other words. There- 
fore, when corpus data of the domain in which the translating apparatus is to be applied is available, it is possible to 
compute distributions of the probability of words being copied, deleted or replaced from corpus data and thereby obtain 
the word copy probability P(echo-word(ew^.iWn)). the word deletion probability P(deiete-word{ewm)) and the word 
insertion probability P(add-word(iWn)). 

When on the other hand corpus data is difficult to obtain, on the basis of tendencies of words in language produc- 
tion (trends in the treatment of words (for example trends such as words tending to be used unchanged or tending to 
be deleted or inserted)), it is possible to classify words into a number of groups and give a word copyirig probability 
P(echo-word(ewm.iWn)). a word deletion probability P(delete-word(ewrn)) and a word insertion probability P(add- 
word(iwn)) to each of these groups. 

In this preferred embodiment, wor6s are classified into the four groups of strong independent words (for example 
verbs and nouns and so on); light independent words (for example adjectives and some adverts (for exarrple "MATA' 
('again') and the like); functional words (for example partides and conjunctions); and modifiers (for example adverbs 
expressing level (tor exarrple TOTEMO* (Very^ (also including adjuncts), and constants of the kind shown in Exps. (22) 
through (24) are given as the word copy probability P(echo-word{ew^.iWn)). the word deletion probability P (delete- 
*«rd(ewr„)) and the word insertion probability P(add-word(iWn)-) respectively depending on which group the word to 
which the word transforming operator is to tje applied belongs to. 



Here, echo-strong-content, echo-light-content, echo-grammatical-function and echo-modifier are predetem^ned con- 
stants between 0 and 1. The word copy probability P{echo-word(ewn„iWn)) is given by Exp. (22) when the words ewn, 
and iwn, are the same; when they are not the san>e. it is made 0 (this is also the same in the case of the stot-name value 
copy probability, which corresponds to the word copy probability). 



Here, delete-strong-content, delete-light-content, delete-grammatical-functlon and delete-modrfier are predetermined 
constants between 0 and 1 . 



P(echo-word(ew^,iw„)) r echo-strong-content - when ew ^ 
is a sti'ong independent word echo-light-content - when ew^ 
is a light independent word echo-grammatical-function - when ew ^ 
is a functional word echo-modifier - when ew ^ is a modifier or an adjunct 



(22) 



P(delete-word{eWrn)) ^ delete-strong-contenl - when ew^ 
is a strong Independent word delete-light-oontent - when ew ^ 
is a light independent word delete-grammalteal-function - when ew ^ 
is a functior«l word delete-modrfier - when ew ^ is a modifier or an adjunct 



(23) 
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P(add-word(iw„)) = add-strong-cx)ntent • when rw„ 
is a strong independent word add-light-content - when iw „ 
is a iight independent word add -grammatical -function - when iw „ 
is a functional word add-modifier - when iw„ is a modifier or an adjunct 



Here, add-strong-content, add-light-content. add -grammatical-function and add-rrxxJifier are predetermined constants 
between 0 and i . 

In this preferred embodiment, in the same way, the slot-name value transformation probabilities, namely the slot- 
name value copy probability P(echo-feature(efp.ifq)). the slot-name value deletion probability P(delete.feature(efn)) the 
siot-name value insertion probability P(add-feature(ifq)) and the slot-name value replacement probability P(alter-fea- 
ture(efp.ifq)). are also given as constants. 

Next, concerning the word replacement probability P(alter-word{ew^.iw J), this can be computed on the basis of 
the closeness in meaning between the words ew^ and iw„. That is. if the word iw„ and the woid ew^ are synonyms a 
high probability can be given as the word replacement probability P(alter.word(ew^.iwJ). if the word iw„ and the word 
ew^ are similar in meaning a slightly lower probability, and when the word iw^ and the word ew^^ are unrelated then a 
lower probability can be given to the wortJ replacement probability P(alter-word(ew^.iWn)). 

This kind of probability distribution relating to the words ew^ and iwn can be obtained using corpus data. That is. 
using context similarity, when two words are used in similar contexts in the corpus data, the similarity between these 
two words can be regarded as being high, and when they are used in different contexts the similarity between two words 
can be regarded as being low. The two words having a high similarity can be given a high word replacement prob^ility. 

When on the other hand corpus data is difficult to obtain, rt is possible to conpute word replacement probabilities 
for exanrple using a thesaurus of the kind shown in Fig. 6. 

Here, in the thesaurus, distances in meaning l^etween two words are reflected as the amount of information 
included by the concept (parent node) common to the two words. That is. when the common concept is an extremely 
general one like for exanple Ihing*. it is not possible to obtain from that concept Information relating to a specific word 
that it includes, but when the common concept is a considerably limited concept like for example Vehicle with engine', 
it is possible to obtain from the concept much information relating to specific words that it includes. In other words, the 
amount of information included by a given concept (a node in the thesaurus) is inversely proportional to the nuriDer of 
words included in that concept (the number-dominated). 

Therefore, the word replacement probability P(alter-word(ew^,fWn)) can be obtained from the amount of infomia- 
tion included in the concept common to both the word ew^ and the word iw^. that is. on the basis of the number of words 
included in that concept in the thesaurus, for example according to Exp. (25). 

''^""^•'^^^"'•^"'■ Kxnumber.JominIted(C(ew,.iw„)) > (25) 



In the above expression. K is defined as follows: 



^~ ? ^number-dominated(C(ew_. iw„)) ^^^^ 

Here, C(ew„^.iwn) is a concept common to the words ew^ and iw^, and number-dominated(C(ewn„iWn)) is the number 
of words included in the corKept C(ew^,iWn) (words having the concept CKew^.iWn) as a higher concept). Vocab means 
the set of wads in the thesaurus. Therefore, the first and second L on the right side of Exp. (26) are respectively the 
summations of the words ew^ and iw^ successively substituted for the words in the thesaurus. 

If a method like this wherein the distance between two words ew^ and iw^ is obtained on the basis of the amount 
of information included in a concept common to them is used, it is possWe to accurately obtain the distance (relative 
distance) between the words Irrespective of the structure of the thesaurus. That is. the thesaurus does not have to be 
constructed regularly, as it has been in the related Rrt, and also the distances between concepts do not have to be the 
same. Also, the method can be applied both to thc.:auruses wherein words are classified minutely and to thesauruses 
wherein words are classified roughly. It can also be applied to tfiesauruses wrfierein classification has been carried out 
minutely or roughly only in a part of the thesaurus. Furthermore, the thesaurus may be one dedicated to a certain spe- 
cific domain or may be a general one. 

The transformation probability P(I|E). as was shown above in Exp. (21), can be obtained as the product of the word 
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transformation probabilities P(distort-wordx(ew^.iWn)) and the product of the slot-name value transformation probabili- 
ties P(distort-wordy(efp.if^^)). Of the product of the word transformation probabilities P(distort-wordx(ew,nJwn)) and the 
product of the slot-name value transformation probabilities P(distort-wordy(efp.ifp)). the product of the slot-name value 
transformation probabilities P(distort-wordy(efp.ifq)) can be obtained relatively simply as tt-e product of ihe slot-name 
value copy probabilities P(echo-feature(efp.ifq)). the slot-name value deletion probabilities P(delete-feature(efp)). the 
slot-name value insertion probabilities P(add-feature(ifq)) and the slot-name value replacement probabilities P(alter- 
feature(efp.ifq)). 

That is. concerning grammatical attributes, the grammatical attributes of the input phrase (hereinafter for conven- 
ience referred to as the input attributes) and the grammatical attributes of the example phrase (hereinaner for conven- 
ience referred to as the example attributes) are compared, and exanple attributes the same as input attributes, 
example attributes not anrong the input attributes, input attributes not among the example attributes, and example 
attributes which can be replaced with input attributes are detected. When there is an example attribute the same as an 
input attribute, an example attribute not among the Input attributes, an input attribute not among the exarrple attritxJtes. 
or an example attribute which can be replaced with an input attribute, the slot -name value copy probability P(echo-fea- 
ture(efp.ifq)). the slot-name value deletion probability P(delete-feature(efp)), the slot-name value insertion probability 
P(add-feature(ifq)) or the slot-name value replacement probability P(alter-feature(efp.ifq)) can be obtained (however, in 
this preferred embodiment, because as mentioned above these are all given as constants, it is not necessary to com- 
pute them), and by calculating their product it is possible to obtain the product of the slot-name value transformation 
probabilities P(distort-wordy(efp.ifq)). 

Specifically, when for example the input phrase is 'YOYAKU O SHITAI NO DESU GA* and its slot-name values are 
(desiderative +) (extended-predicate +) (s-part GA) (formal +) and the example phrase is "YOYAKU 0 SHITE ITADAKI- 
TAl NO DESU GA' and its slot-name values are (gerund-suffix ITADAKU) (desiderative +) (extended-predicate +) (s- 
part GA) (formal +), the slot-name values (desiderative +). (extended isredicate -i-). (s-part GA) and (formal +) are 
detected as example attributes the same as input attributes and the slot-name value (gerund-suffix ITADAKU) is 
detected as an example attribute not among the input phrase attributes. 

In this case, by calculating the product of the slot-name value copy probabilities P(echo-feature(efp,ifq)) of the slot- 
name values (desiderative +). (extended-predicate +), (s-part GA) and (formal +) and the slot-name value deletion prob- 
ability P(delete-feature(efp)) of the slot-name value (gerund-suffix ITADAKU) it is possible to obtain the product of the 
slot-name value transformation probabilities P(distort-wordy(efp.ifq)). 

In obtaining the product of the word transtornnation probabilities P(distort-word,(eWrT,.iWn)), on the other hand. It is 
necessary to find the as it were most direct transfornfiation route from the example phrase E to the input phrase I from 
anrx)ng a number of transformation routes. 

That is. when for exanple 'KOKO Nl 3JI Nl DENWA SURU* as an exanple phrase E has been assigned the trans- 
lation 'call here at 3 o'clock*, and this exarrple phrase E is used in translation of the input phrase 'RESUTORAN Nl 
DENWA SURU'. for example the following two transformation routes (transformation methods) are conceivable. 

firstly, if with respect to the exanple phrase 'KOKO Nl 3JI Nl DENWA SURU'. from the first word. 'KOKO* is deleted. 
N r is deleted. 'SJI' is replaced with 'RESUTORAN'. 'Nl' is copied. 'DENWA' is copied. 'O' is inserted and 'SURU' is cop- 
ied, the input phrase 'RESUTORAN Nl DENWA SURU* Is obtained. Therefore, in this case, in the translation 'call here 
at 3 o'clock', 'here* caresponding to "KOKO Nl' is deleted. '3 o'clock* corresponding to 'SJI* and 'the restaurant' corre- 
sponding to 'RESUTORAN* are replaced, and the translation 'call at the restaurant' is obtained. 

Secondly, if with respect to the exanple phrase 'KOKO Nl 3JI Nl DENWA SURU'. from the first word. 'KOKO' is 
replaced with 'RESUTORAN'. 'Nl' is copied. '3Jr is de(eted. 'Nl* is deleted, 'DENWA* is copied. 'O' is inserted and 
'SURU' is copied, then again the input phrase 'RESUTORAN Nl DENWA SURU* is obtained. In this case, in the trans- 
lation 'call here at 3 o'clock*, 'here' corresponding to 'KOKO Nl' is replaced with the restaurant* corresponding to 'RESU- 
TORAN Nl' and 'at 3 ododK corresponding to "SJI Nl' is deleted, and the translation call the restaurant' is obtained. 

In this case, the translation 'call the restaurant', obtained with the second transformation route, is correct and there- 
fore, in obtaining the word Iranstormation probability P(di6tort-wordx(ewm,iwn)), rt Is necessary to find the as it were 
most direct transformation route (hereinafter for convenience referred to as the optimum route) like this sdcond trans- 
forn^tion route. 

To do this, even when the lengths of the example phrase E and the input phrase I are different, i.e. even when the 
example phrase E and the input phrase I consist of different nunnt>ers of words, it is necessary to recognize correspond- 
ing words and find the best correspondence relationship (alignment) between the example phrase E and the input 
phrase I and then find the set of most probably-correct transforming operators (word transforming operators) that 
should be applied to transformation of the example phrase E into the input phrase I. 

Now. if the set of word transforming operators is .vritten Distort (=(distort-wordi(ew^.iWn). distort-word2(eWn»,iWn). 
• • • . distort-wordj,(ew^.iWn))), the set Distort^ of most probably-correct word transforming operators (hereinafter for 
convenience referred to as the optimum word transforming operators) to be applied to transformation of the example 
phrase E into the input phrase I can be obtained using Exp. (27). 
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Distort^=maxc5jston(P(C>'Stort|E.I)] (27) 

The right side of Exp. (27) means an operation for obtaining the Distort for which the value inside the Q is the maximum. 

Here, the set of optimum word transforming operators is obtained for example using DP (Dynamic Programming). 
That is. first, to obtain from the probability P(Distort|E.I) the distance in DP. its negative log of -log(P(D(Ston|E,l)) is con- 
sidered. Now. writing D = -log(P( Distort] E.I)) . by assuming that the operators constituting the word transibrming oper- 
ator set Distort are independent, D can be simplrfied as shown in Exp. (28). 

x 

D=-log PI P{distort-word ^(ew ^ jw J|E.I) (28) 

x-1 



If it is assumed that the word transforming operators are independent from those words, among the words of the 
input phrase I. which they do not affect directly. Exp. (28) can be further simplified as shown in Exp. (29). 

X 

D=-log Y] P(distort-word ^(ew ^,iw „)|ew ^.iw „) (29) 



Dividing Exp. (29) for the individual transforming operators yields Exp. (30). 

X 

D= ^ -log P(distort-wond ,(ew ^,iw „)) (30) 

x«1 



The -log(P(distort-wordx(ewm.iWn))) on the right hand side of Exp. (30) is the distance between the words ewn, and 
iwn when the nrrth word ew^ of he exannple phrase and the nth word iw^ of the input phrase are made to correspond, 
and this can be seen as directly corresponding to individual penaftles (weight on the path) (cost) used in dynamic pro- 
gramming equations. Therefore, if distort-wordx(ew„.iWp) is rewritten as the wad copying operator echo- 
word(ewn^.iwn). the word deleting operator delete-word(ew^). the word inserting operator add-word(iw„) and the word 
replacing operator alter- word(eWj^,iwn) and the sum of the distances (the cumulative distance) of when as far as the nrth 
word ew„ of the example phrase and the nth word iw^ of the input phrase are made to correspond is expressed G(m.n). 
this cumulative distance G(m,n) can be obtained regressiveiy according to the gradualized expression shown in Exp. 
(31). 



G(fn, n)=min 



'G(m— 1, n-1) - logP(echo— word(ewm.iwn)) 
G(m, n— 1 ) - logP(add-word(iwn)) 
G(m-1 , n) - logP(delete-word(eWfn)) 

-G(m-1 , n-1) - logP{alt€r-word(ewtn, iwn) 



(31) 



For the initial values, suitable values are assigned. 

By calculating Exp. (31) for until for example m=M, it is possible to obtain the distance (the shortest distance) 
between the input phrase I and the example phrase E, and the path (DP path) obtained at that time shows the best (opti- 
mum) alignment t>etween the input phrase I and the example phrase E, i.e. the optinnum path. The set of word trans- 
forming operators which transform the example phrase E to match the input phrase I according to this optimum path 
(alignment) are the optinnum word transforming operators. 

By adopting the approach described above, it is possible to obtain the set of optimum word transforming operators 
even when the lertgths of the example phrase E and the input phrase I are different 

Because the probability P(Distort|EJ) is the probability of a set of word transforming operators Distort being used 
to collate an input phrase I with an exanrple phrase E and eliminate the differences in expression between the two. the 
probability model based on this probability can be called a stochastic expression collating model, and because the dis- 
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tance between the input phrase I and the example phrase E is obtained on the basis of this stochastic expression col- 
lating nnodel it can be called a stochastic-expression-collating-nxxjel-based distance. 

After the set of optimum word transforming operators is obtained in the way described above it is possible to obtain 
the product of the word transfornnation probabilities P{distort-wordx(ew^jwn)). and it is then possible to obtain the trans- 
formation probability P{I|E) from this and the product of the slot-name value transformation probabilities P{distort- 
wordy(efp.ifq)) according to Exp. (21). By calculating the product of this transformation probability P(I|E) and the priori 
probability P(E) it is possible to obtain the translation suitability (P(I|E)P(E)), and the example phrase E of which this 
translation suitability is the greatest is the example phrase most similar to the input phrase I. 

In the converting part 7, the example phrase E most similar to the input phrase I is detected in the way described 
above. That is. using a priori probability nrxxiel and a transformation probability model respectively based on the priori 
probability P(E) and the transformaHon probability P(I|E). the example phrase E most similar to the input phrase I is 
detected. In this case, by using the transformation probability model, without excessively adding example phrases hav- 
ing superficial variations and omissions of particles and nouns and hesitations and so on in so-called spoken language 
to example phrases already prepared, it is possible to detect the exan^ale phrase most suitable for translating an input 
phrase, and as a result it is possible to increase the quality of translations. 

Next, with reference to the flow chart of Fig. 12. the processing in the converting part 7 by which the example 
phrase E most similar to the input phrase I is detected will be described further. It will be assumed that priori probabili- 
ties P(E) have been assigned to the example phrases E and are stored in the noun phrase translation example memory 
8. the verb phrase translation example memory 9 and the other translation example memory 10. It will be assumed also 
that the word copy probability P{echo-word(ew^.iWn)). the word deletion probability P(de!ete-word(ew J) and the word 
insertion probabilHy P(add-word{iwJ). and the slot-name value copy probability P(echo-feature(efp.ifq)). the slot-name 
value deletion probability P(delete-feature(efp)). the slot-name value insertion probability P(add-feature(if<,)) and the 
slot-name value replacement probability P(alter-feature(efp.ifq)) are given as constants and stored in a memory inside 
the converting part 7. Also, here, as an input phrase, it will be assumed that for example a noun phrase has been input- 
ted into the converting part 7 and that therefore processing will be carried out using only the example phrases stored in 
the noun phrase translation example memory 8. However, if a verb phrase or an other phrase is inputted into the con- 
verting part 7 as the input phrase, the only difference is that the example phrases used in the processing become those 
stored in either the verb phrase translation example nnemory 9 or the other translation example memory 10. and the 
processing method is the same as in the case of a noun phrase described in the following. 

When the converting part 7 receives an input phrase I (hero, as mentioned above, a noun phrase), first, in a step 
S31 . it reads an example phrase (a noun phrase) from the noun phrase translation exanple memory 8 (hereinafter, this 
read-out exarrple phrase E will be called the focus example phrase). Also, in a step S32, the converting part 7 calcu- 
lates the transformation probability P(I|E) of the focus example phrase E with respect to the input phrase 1, and then 
proceeds to a step S33. In the step S33, in the converting part 7. the probability P(E) assigned to the focus example 
phrase E is read from the noun phrase translation exaople memory 8. and this and the transformation probability P(I|E) 
calculated in the step S32 are multiplied together. In this way. the translation suitability of the focus example phrase E 
with respect to the input phrase I is obtained. 

After the translation suitability is obtained, proceeding to a step S34. the converting part 7 determines whether or 
not calculation of a translation suitability has been carried out for all the example phrases stored in the noun phrase 
translation example memory 8. When in the step S34 it is determined that calculation of a translation suitability has not 
been carried out for all the exanrple phrases stored in the noun phrase translation example memory 8. processing 
returns to the step S31. an example phrase E for which a translation suitability has not yet been calculated is read out 
from the noun phrase translation example memory 8, and the processing of the step S32 and the subsequent steps is 
repeated with this as the focus example phrase E. 

When on the other hand it is determined in the step S34 that calculation of a translation suitability has been carried 
out on all the example phrases stored in the noun phrase translation example memory 8. processing proceeds to a step 
S35, the example given the greatest translation suitability with respect to the input phrase I is detected in the converting 
part 7, and then processing ends. 

Next the transformation probability calculation processing of the step S32 of Fig. 12 will be described with refer- 
ence to the flow charts of Fig. 13 and Fig. 14. 

First. Fig. 13 is a flow chart illustrating the slot-name value transformation probability calculation processing. The 
converting part 7 first in a step S41 conpares the input phrase I to the focus example phrase E and obtains slot-name 
value copy probabilities, slot-name value deletion probabilities, slot-name value insertion probabilities and slot-name 
value replacement probabilities. Then, in a step S42, the converting part 7 obtains a slot-name value transformation 
probability by multiplying these together, whereupon processing ends. 

Rg. 14 is a flow chart illustrating the word transformation probability calculation processing. The converting part 7 
first in a step S51 obtains the cumulative distance G(m.n) between the input phrase I and the focus example phrase E 
according to Exp. (31) and then proceeds to a step S52 and determines whether or not m is less than M (the nuntier 
of words constituting the focus example phrase E). When in the step S52 it is determined that m is less than M, process- 
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ing returns to the step S51 and the converting part 7 increases m and calculates Exp. (31) while changing n. Then, in 
the step S52. when it is determined that m is not less than M, i.e. when m is equal to M. the path of when the cumulative 
distance G(m.n) was obtained, i.e. the optimum path, is detected and processing proceeds to a step S53. 

In the step S53. in the converting part 7. the word transformation probability is calculated following the obtained 
optimum path. That is. word copy probabilities, word deletion probabilities, word insertion probabilities and word 
replacement probabilities are obtained and by these being multiplied together the word transformation probability is 
obtained. With respect to the word replacement probabilities, the converting part 7 controls the similarity degree calcu 
lating part 1 1 to calculate it according to Exp. (25). That is. the similarity degree calculating part 1 ^ obtains the number 
of words included in a concept in the thesaurus stored in the thesaurus memory 12 common to a word of the input 
phrase I and a word constituting the focus example phrase E and on the basis of that number of wonds obtains a word 
similarity degree expressing the similarity in meaning between the two words, i.e. the word replacement probability 
obtained by calculating Exp. (25). 

Next. Fig. 1 5 shows the construction of another preferred embodiment of a translating apparatus according to the 
invention. Parts in Fig. 15 corresponding to parts shown in Fig. 5 have been given the same reference numerals and 
will not be described in the following. That is. except in that the converting part 7 has a cacfie 7A. this translating appa- 
ratus is basically constructed in the same way as that shown in Fig. 5. 

The cache 7A temporarily stores provisionally deleted words and provisionally added words. The cache 7A is called 
a 'cache' only because it stores information temporarily like a cache memory disposed between a CPU and main mem- 
ory, and it does not necessarily have to be a high-speed memory like an ordinary cache memory. 

In the preferred embodiment described above, as transforming operators operating on words constituting an exam- 
ple phrase E, the word copying operator echo-word(ew^.iwJ. the wad deleting operator deIete-word(ew J, the word 
inserting operator add-word(iwn) and the word replacing operator alter- word(ewm.iwn) were used, but with these four 
operators it is difficult to deal with cases where the example phrase E is transformed into the input phrase I just by the 
positions of the words of the example phrase E being changed (for example when the example phrase E is 'WATASHl 
WA MIKAN O TABETAI* ( I want to eat an orange') and is to be transformed into the input phrase I MIKAN O WATASHl 
WA TABETAI') ( an orange • I want to eat ) and cases where the example phrase E is to be transformed into the input 
phrase I by words of the example phrase E being be replaced with words of the input phrase I and also having their posi- 
tions changed (for example when the example phrase E is 'WATASHl WA MIKAN O TABETAI' ( I want to eat an orange*) 
and is to be transformed into the Input phrase I 'RINGO O WATASHl WA TABETAI' ( an apple - 1 want to ear)). 

Also, in cases where there is included in the input phrase I a conpound word (for example a compound noun) con- 
sisting of a word used as a modifier among the words of the input phrase I added to a word of the example phrase E 
(for example, when the example phrase E is 'DENWA O MITSUKERLT (lind a telephone") and the input phrase I is 
'KOUSHUUDENWA O MITSUKERU' (lind a public telephone*) and cases where there is included in the input phrase I 
a conpound word consisting of a word used as a nnxfifier among the words of the input phrase I added to a word of the 
exanple phrase E having been replaced with a word of the input phrase I (for example, when the exanple phrase E is 
'DENWA O MITSUKERU' (1ind a telephone*) and the input phrase I is 'KOUSHUUTOIRE O MITSUKERU' (lend a public 
toilet*)) it is necessary to make a word of the example phrase E correspond with a compound word of the input phrase 
I (in the above examples. 'DENWA' and 'KOUSHUUDENWA' (telephone' and "public telephone"). 'DENWA' and 
'KOUSHUUTDIRE* {'telephone* and 'public toilet*)), but this is dfficult to do with the four operators mentioned above. 

To overcome this, instead of the word deleting operator (an operator which irreversbly determines deletion at the 
time of its operation) delete-word{ewm). which actually deletes the word ew^ of the example phrase E. a provisionally 
deleting operator which is an operator for deleting the word ew^, provisionally will be introduced. Because this provision- 
ally deleting operator temporarily renx)ves (stores) the word being provisionally deleted (hereinafter refen-ed to as the 
provisionally deleted word) ew„ to the cache 7A. it will be written cache-as-deleted(ew J. 

Also, instead of the word inserting operator add-word(iWn). which actually inserts the word iw^ of the input phrase 
into the example phrase E (irreversibly determines insertion at that time), a provisionally inserting operator which is an 
operator for inserting that word rwp into the example phrase E provisionally will be introduced. Because this provision- 
ally inserting operator temporarily removes the word iw^ being provisionally inserted (hereinafter referred to as the pro- 
visionally inserted word) to the cache 7A, it will be written cache-as-added(iwn). 

Also, a saannbling operator SCTamble(ew^,iWn), which is an operator for copying unchanged a word ew^ of the 
example phrase E as a word iw^ which among the words of the input phrase I is not the word aligned with the word ew^, 
or replacing the word ew^ with such a word fw^, i.e. changes only the position of a word of the example phrase E or 
replaces a word of the example phrase E with a word of the input phrase I and also changes its position, will be intro- 
duced. 

Also, a compounding operator coapound{iWn.t,iwn) which is an operator for identifying two consecutive words Iw^ 
1 and iwn of the input phrase I as constituting a compound word when those two words are a modifier and a modified 
word will be introduced. In this case, an operator for copying unchanged a word ew^ of the example phrase E as a word 
iwn constituting the phrase input phrase I and making it form a conpound word with the word iwn i one word in front of 
it can be expressed as the combination of the word copying operator echo-word(ewn,.iwn) and the compounding oper- 
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ator compound(iWn.,.rw^) (hereinafter for corwenience referred to as the copying and compounding operator). Also, an 
operator for replacing a word ew,„ of the example phrase E with a word iw^ of the input phrase I and then making it form 
a compound word with the word iw^., one word in front of it can be expressed as the combination of the word replacing 
operator alter -wofd(ew^JwJ and the compounding operator compound(iWn.i.iWn) (hereinafter for convenience re^^rred 
to as the replacing and conpounding operator). 

In the preferred embodiment shown in Fig. 15. as operators operating on the words of the example phrase E, the 
six operators that are the word copying operator echo-word (ew^.iwn). the word replacing operator alter -word(ewni.iwn), 
the provisionally deleting operator cache-as-deleted(ewj. the provisionally inserting operator c'>':he-as-added(iwn). 
the scrambling operator scramble(ewrn.iwn) and the compounding operator compound{iw„.,.iw.,) are employed Of 
these six operators, the compounding operator compound(iw„.,.iwn) only is not used alone and as aescribed above is 
used as the copying and compounding operator or the replacing and compounding operator in combination with the 
word copying operator echo-word(eWn,.rWn) or the wad replacing operator alter-word(ewr„.iw„). 

Of the newly introduced operators, the provisionally deleting operator cache-as<ieleted(ew,n) and the provisionally 
inserting operator cache-as-added{iw„) can be used in the same ways as the word deleting operator delete-word(ew^) 
and the word inserting operator add-word(iWn) respectively (although in the point that they provisionally delete or insert 
words by storing the word in the cache 7A they differ from the word deleting operator delete-word(ew^) and the word 
inserting operator add-word(iWn). which in^eversibly determine deletion or insertion of the word), but in the use of the 
scrambling operator scramble(ew^.iwn) and the copying and compounding operator and the replacing and compound- 
ing operator there are the following kinds of limitation. 

That is. transforming an example phrase E into an input phrase I by changing just the position of a word of the 
example phrase E or replacing a wad of the example phrase E with a word of the input phrase I and then changing its 
position can be realized by either deleting the word whose position is to be changed and then inserting it in a predeter- 
mined position or by inserting the word whose position is to be changed in the predetermined position and then deleting 
the wad in the original position. 

Therefore, when transforming an example phrase E into an input phrase I, to change the position of a word it is nec- 
essary for that wad to have been deleted from the example phrase E or inserted into the input phrase I in the past. 
Because of this, the scrambling operator scranr*(e(ew^.iWn) is only used when a word ew^ of the example phrase E 
has been provisionally deleted or when a word iwp of the input phrase I has been provisionally inserted, ie. when a pro- 
visionally deleted word ew^ or a provisionally inserted word iw^ has been stored in the cache 7A. 

Specifically, in cateulating an optimum path, in a case where the transfamation path from the nrnh word ew^ of the 
example phrase E to the nth wad iw^ of the input phrase I is a problem, when a provisionally deleted word ewj^ (k<m) 
is stored in the cache 7A. it is possible to use the saamWing operator scrarT*3le(ew^.iWn) (the nth word iw^ of the input 
phrase I can be deemed to have been made not by transforming the mth word ew^, of the example phrase E but by a 
transfanr^tion changing the position of the klh word ew^ of the example phrase E, without changing the word or after 
replacing the word). 

Also, in the case mentioned above, also when a provisionally inserted word iw, (s<n) is stored in the cache 7A. it 
is possible to use the scrambling operator scrambleCew^.iWa) (the mth wad ew^, of the example phrase E can be 
deemed not to have been transformed into the nth word iw„ of the input phrase I but to have been transformed into the 
sth wad iwg of the input phrase I. either without being changed a after being replaced, by having its position changed). 

When the scrambling operator Scramble(ewk.iWn) or the scrambling operator scramble(ew^.fWe) has been used 
(when in the process of calculating the optimum path a transfamation path corresponding to one of these has been 
selected), because at that time it is finally determined that the provisionally deleted word ew^ or the provisionally 
inserted wad iw, has not been deleted a inserted, that is, it is finally detern^ned that its position has been changed, it 
is aased from the cache 7A. 

Next concerning the copying and compounding operator and the replacing and compounding operator, there are 
the following kinds of limitation on their use. 

That is, to copy without changing it a wad ew^ of the example phrase E as a word iwp of the input phrase I and 
make it form a conrpound word with the word iw^.i one word in front of it. or to replace a word ew^n of the example 
phrase E with a word iwp of the input phrase I and make it fam a compound word with the word iwo., one word in front 
of it it is necessary tor the word iw^.^ serving as a modifier to have in the past been inserted into the input phrase I. 
Consequently, the copying and compounding operator and the replacing and conpounding operator are used only 
when a word iw^., of the input phrase I has been provisionally inserted in the past, i.e. when the provisionally inserted 
word iw^.^ is stored in the cache 7A. 

Specifically, in cak:ulr*tng the optimum path, when the transformation path from the mth word ew^^ of the example 
phrase E to the nth word iw^ of the input phrase I is a problem, the copying and compounding operator and the rpplacing 
and compounding operator can be used when the provisiorwlly inserted word iw^.^ has been staed in the cache 7A. 
i.e. when the word iwn.^ has been provisionally inserted in front of the word iw^. 

When the copying and compounding operata or the replacing and compounding operator has been used, because 
at that point in time it is finally determined that the provisionally inserted wad iw^., has not been inserted, i.e. it is finally 
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determined that it constitutes a compound word, it is erased from the cache 7A. 

Next, to obtain the word transformation probabilities of when transformation of words is carried out using these six 
transforming operators (the word copying operator echo-word(ew^.iw„). the word replacing operator alter- 
word(ew^.iwJ. the provisionally deleting operator cache -as-deleted(ew^). the provisionally inserting operator cache- 
as-added{iwj. the scrambling operator scramble(ewn,.iwn) and the compounding operata compound(iWn,i.iWri)). prob- 
abilities of the transforming operators (probabilities of transformations using the transforming operators being carried 
out) are necessary, and these can be defined I'or example as follows. 

That is. first, the probability (the word copying probability) P(echo-word(ew^.iw„)) for the wad copying operator 
echo-word(ew^.iwJ and the probability (the wo^d replacement probability) P(alter-word(ew^.iw„)) for the word replac- 
ing operator alter-word(ew„,.iw„) are given by Exp. (22) and Exp. (25) respectively, as in the preferred embodiment 
described above. 

Next, the probability (hereinafter for convenience referred to as the provisional deletion probability) P(cache-as- 
deleted(ew^)) for the provisionally deieting operator cache-as<leleted(ewj is given for example using the word dele- 
Iton probability P(delete-word(ew J) and the word copy probability P(echo-wofd(ew^.iw J) so as to satisfy the following 
expression. 



P(delete-word(ew^))<P(cache-as-deleted(ew^))<P(echo-word(ew^.iw„)) (32) 

Here, the provisional deletion probability P(cache-as-deleted(ew^)) has a value satisfying Exp. (32) and for exam- 
ple lower than the word replacement probability P(alter-word{ew„.iwJ) for a similar word and higher than the word 
replacement probability P(alter-word(ewm.iWn)) for a dissimilar word. 

The probability for the provisionally inserting operator cache-as-added(iwn) (hereinafter for convenience referred to 
as the provisional insertion probability) P (cache-as-added (iwj) is given for exarrple using the word insertion probability 
P(add-word(iWn)) and the word copy probability P(echo-word(ew„,.iWn)) so as to satisfy the following expression. 

P(add-word(iw „)) <P(cache-as-added(iw n))<P(echo-word(ew ^-'w „)) (33) 

Here, the provisional insertion probability P(cache-as-add9d(rw J) has a value satisfying Exp. (33) and for example 
lower than the word replacement probability P(alter-word(ewrT».iWn)) fa a similar word and higher than the word replace- 
ment probability P(alier-word(eWn,,lwJ) for a dissimilar word. 

Also, when the word ew^ is the same as the word iw^. the word copy probability P(echcHword(ew^.iWn)) in Exps, 
(32) and (33) is given by Exp, (22) (when the words ew^ and iw^ are not the same, because as mentioned above the 
word copy probability P(echo-word(ew^.iwJ) becomes 0. in this case, the upper limrt of the provisional deletion proba- 
bility P(cache-as-deleted(ew^)) and the provisional insertion probability P(cache-as-added(twJ) is not limited by the 
word copy probability P(echo-wword(ew^,iwn))). 

The probability P(scrarTtile(ew^.iwJ) (hereinafter for convenience referred to as the saamble probability) for the 
Ecrarrtsling operator scramble(ewn,.iwn) can be given by the following expressions. 



P(scramble(ew^,rw„)) = f(P(alter-word(ew„.iw J).iw„) when ew^ ^ iw„ (35) 
P(scrant)le(ew^;iw„)) = f{P(echo-word{ew^.iw„)).iwJ when ew^ = iw„ 

In Exp, (35), the function fQ obtains the likelihood of occurrence of a phenomenon (saamWing) of the order of 
words or phrases and so on changing, and can be set taking into account for example the nature of the word stored in 
the cache 7A (for example its part of speech), the language of the input phrase I and the situation in which the input 
phrase I is being used. That is, the function fQ can be set so as to give a higher probability with respect to input phrases 
I of Japanese, in which word order changes are freer than in English and the like, than to input phrases I of Eriglish, The 
function fQ can also be set so as to give a higher probability with respect to spoken language, in which word order 
changes such as Inversions are more likely to occur than in written language. 

In Exp. (35), as the argument of the function fQ. when ew^^iw^ the word replacement probability P(a!ter- 
word(ew^.iw J) is used; this is because when the words ew^ and iw^ are not the same (when ew ^*iw „ ), It can be con- 
sidered that the word of the exanrple phrase E has been replaced and the word order has been changed, and when the 
words ew^ and iw^ are the same (when ew^^lw^ ) it can be considered that the word of the example phrase E has 
been copied without being changed and only the word order has been changed. 

In Exp. (35), in both the cases of ew ^^'m „ and ew ^=iw „ . as the argument of the function f(). the wad iw^ of the 
input phrase I is used; this is because it can be supposed that the likelihood of a wcrd order change varies depending 
on the word iw^. That is, for example in J^nese there are almost no cases where a verb is moved to the beginning, 
and therefore when the word iw^ is a verb It takes a value such that the scramble probability P(scrani)le{ewm.iWn)) is 
small. 
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Next, the probability (hereinafter for convenience referred to as the compound probability) P(compound(iWn.,.iWn)) 
is for example given by the following expression. 

P(compound(iw ^ , .iw )=g(iw „. ^ .iw (36) 

In Exp. (36). the function g(iwp.i.iwn) expresses the probability of the consecutive words iw„.^ and iw„ forming a 
compound word, and this can be calculated for example in the following way. That is. it is possible for example to 
prestore numerous examples compound words and find the compound word most similar to the combination of the 
words iwn.,. iwn and normalize its degree of similarity to obtain a probability of them forming a compound word. 

When the degree of similarity between the combination and the stored compound word example is large enough, 
the compound probability P(compound(iWn.i,iWn)) is higher than the word insertion probability P(add-word(iWn)). 

The probability (hereinafter for convenierxe referred to as the copy and compound probability) of a word ew^ of the 
example phrase E being copied as a word iw^ of the input phrase I without being changed and forming a corrpound 
word together with the word in front of it is the product of the word copy probability P(echo-wof dlew^^jw^)) and the com- 
pound probability P(compound(iWn.,.iw„)). and the probability (hereinafter for convenience referred to as the replace 
and compound probability) of a word ew^ of the example phrase E being replaced with a word iw^ of the input phrase 
I and forming a compound word together with the word in front of it is the product of the word replacement probability 
P(alter-word(ew^.iWn)) and the compound probability P(compound(iWn.i.iwJ). 

With the six transforming operators of the word copying operator echo-word(ewni.iwn). the word replacing operator 
after-wofd{ewm.iWn). the provisionally deleting operator cache-as-deleted (ewm). the provisionally inserting operator 
cache-as-added(iwn). the scrambling operator scramble(ew^.iwn) and the compounding operator compound(iw^^.iwn) 
it is possible to carry out the following eight kinds of transformation. 

That is. firstly, with the word copying operator echo-word (ew^.iwn). it is possible to make the word ew^ of the exam- 
ple phrase E the word iw^ of the input phrase I without changing it. Secondly, with the word replacing operator alter- 
word(ewrT^.iWn). it is possible to replace the word ew^, of the example phrase E with the word iw^ of the input phrase I. 
Thirdly, with the provisionally deleting operator cache-as-deleted{ew^), it is possible to provisionally delete the word 
ew^ of the example phrase E, and fourthly, with the provisionally inserting operator cache-as-added(iWn), it is possible 
to provisionally insert the word iw^ of the input phrase I. 

Also, with the scrambling operator 6crannble(ew^.iWn). it is possible to can-y out the following two word order 
changes. That is. fifthly, with the scrambling operator Scramble(Gwj^,iwJ, it is possible to change the word order by 
inserting a provisionally deleted word ow^ stored in the cache 7A as the nth word iw^ of the input phrase I, and sixthly, 
with the SCTamWing operator scramble(ew^,iwj. it is possible to make a provistonally inserted word iw^ a change In the 
word order of the mth word ew^ of the example phrase E. 

Seventhly, with the combination (copying and compounding operator) of the word copying operator echo- 
wond(ew^,iwJ and the compounding operator conpound(iwn.i.iwn). ft is possitDle without changing it to copy the word 
ew^ of the example phrase E as the word ew^ of the input phrase I and make it form a compound word with the word 
iwn-i one word in front of it. and eighthly, with the combination (replacing and conopounding operator) of the word replac- 
ing operator a]ter-word(ew^.iWn) and the compounding operator conpound(iwn.i.iwn). it is possible to replace the word 
ew^ of the example phrase E with the word iwp of the input phrase I and make it form a compound word with the word 
rwn.1 one word in front of it. 

Considering finding the optimum set of transforming operators able to carry out these eight Uansformations, i.e. the 
optimum path, by means of DP matching, this can be done for example by a cateulating cumulative distarKe G(m,n) 
using weightings of paths corresponding to the eight transformations shown in Exp. (37), similar to the case in Exp. (31). 
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G(nri. n)=min^ 



w 



15 



G(m-1 . n-1) - logP(echo-word(eWm,iWn)) 
G{rn-1 . n-1) - logP(alter-word(ewm,iwn)) 
G(m-1, n)" logP(cache-a3-delGted(GWm)) 
G(m, n-1)- logP(cache-as-added(iwn)) 
G(m, n-1)- logP(scramble(ewk.iwn)) 
G(m-1. n)- logP(scramble{eWfT,,iWs)> 
G{m-1, n-1) - logP(GCho-word(ewm.iWn)) 

- logP{compound(iwn--i,iWn)) 
G(m-1. n-1) - logP(alter-word(ev/m,iwn)) 

- logP(compound(iwn-i,iWn}) 



(37) 



In the preferred embodiment showri in Fig. 15. in the converting part 7. by calculating Exp. (37) until m=M . using 
20 the cache 7A as necessary, an optimum path is found and a word transformation probability is calculated. 

That is. as shown in the flow chart of Fig. 16. in the converting part 7, first, in a step S61 . according to Exp. (37). 
the cumulative distance G(m.n) of when the word ew^ constituting a focus example phrase E is made to correspond 
with the word iw^ of the input phrase I is found. 

Then, proceeding to a step S62. it is determined whether or not in the calculation of the cumulative distance G{m.n) 
25 in the step S61 the path corresponding to the provisionally deleting operator cache-as-deleted(ewm). i.e. the third 
expression from the top on the right hand side in Exp. (37) was selected, and when it is determined that it was selected 
processing proceeds to a step S63. In the step S63. the word ew^ provisionally deleted by the provisionally deleting 
operator cache-as-det6ted(ewnj is removed to (stored in) the cache 7 A and processing proceeds to a step S64. 

When in the step S62 rt is determined that the path corresponding to the provisionally deleting operator cache-as- 
30 deleted(ew^) was not selected, the step S63 is skipped and processing proceeds to the step S64, wherein it is deter- 
mined whether or not in the calculation of the cumulative distance G(n),n) the path corresponding to the provisionally 
inserting operator cache-as-added(iWn), i.e. the fourth expression from the top on the right hand side in Exp. (37). was 
selected. When in the step S64 it is determined that the path corresponding to the provisionally inserting operator 
cache-as-added(iwn) was selected, processing proceeds to a step S65 and the word ew^, provisionally inserted by the 
35 provisionally inserting operator cache -as-added(iwn) is rerrxwed to the cache 7A, and processing proceeds to a step 
S66. 

When in the step S64 it is determined that the path corresponcfing to the provisionally inserting operator cache-as- 
added(iwj was not selected, processing skips the step S65 and proceeds to a step S66. wherein it is determined 
whether ornot in the calculation of the cumulative distance G(m.n) a path con-esponding to the copying and compound- 
ing operator or the replacing and compounding operator, i.e. the expression on the seventh line or the eighth line from 
the top on the right side in Exp. (37), was selected. When in the step S66 it is determined that a path corresponding to 
the copying and compounding operator or the replacing and compounding operator was selected, processing proceeds 
to a step S67. and because a provisionally deleted word iw^.^ made a modifier constituting a compound word by the 
copying and compounding operator or the replacing and conpounding operator is stored in the cache 7A this is erased 

4S and processing proceeds to the step S68. 

When in the step S66 it is determined that neither the paths corresponding to the copying and compounding 
operator or the replacing and compounding operator was selected, processing skips the step S67 and proceeds to the 
step S68 and it is determined whether or not in the calculation of the cumulative disUnce G(m,n} a path corresponding 
to the scrambling operator scramble(ew^.iwn) or the scrambling operator scrant)le(ew^.iWs). i.e. the expression on the 

50 fifth or the sixth line from the top on the right side in Exp. (37) was selected. When in the step S68 it is determined that 
a path corresponding to the scrambling operator scramble(ewj^,iwn) or the scrambling operator scramble(ew^,iwj was 
selected, processing proceeds to a step S69 wherein because a provisionally deleted word ew^ or a provisionally 
inserted word iw^ is stored in the cache 7A this is erased, and then proceeds to a step S70. 

When on the other hand in the step S68 it is determined that neither of the paths corresponding to the sciamt)Iing 

55 operators scramble(ew,^,iwn) or scramble{ewrn,iw5) were selected, processing skips the step S69 and proceeds to the 
step S70. wherein it is determined whether or not m is less than M (the number of words constituting the focus example 
phrase E). When it is determined that m is less than M, processing returns to the step S61 and the converting part 7 
increases m and n and repeats the subsequent processing. When in the step S70 it is determined that m is less than 
i.e. m is equal to M. processing proceeds to a step S71 and it is determined whether or not a provisionally deleted 
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word eWfj, or a provisionally inserted word iw^ is stored in the cache 7A. 

When in the step S71 it is determined that a provisionally deleted word ew^, or a provisionally inserted word iw^ is 
stored in the cache 7A, processing proceeds to a step S72 and, among the paths of when the cumulative distance 
G(M.n) was obtained, the path corresponding to the provisionally deleting operator cache-c'.i-deleted(ew^) having pro- 

5 visionally deleted the provisionally deleted word ew^ stored in the cache 7A or the path corresponding to the provision- 
ally inserting operator cache-as-added(iwn) having provisionally inserted the provisionally inserted word iwp stored in 
the cache 7A is respectively converted into the path con-esponding to a word deleting operator delete -word(ew^) or a 
path corresponding to the word inserting operator add-word{iWn) arxJ the path obtained as a result is taken as the opti- 
mum path, and processing proceeds to a step S73. 

w When in the step S71 it is determined that neither a provisionally deleted word ew,n nor a provisionally inserted 
word iw„ is stored in the cache 7A, the path of when the cumulative distance G(M,n) was obtained is taken unchanged 
as the optimum path, processing proceeds to the step S73 and the word transformation probability is calculated follow- 
ing this optimum path. 

In this case, the optimum path includes a path corresponding to the provisionally deleting operator cache-as- 
ts deleted(ew^) or the provisionally inserting operator cache-as-addedliw^) respectively having provisionally deleted or 
provisionally inserted the provisionally deleted word ew^ or the provisionally inserted word iw„ erased from the cache 
7A. but this path can be ignored in calculating the word transformation probability. 

As a result of the scrambling operator scramble(ewrp.iWn) being introduced together with the provisionally deleting 
operator cache-as-deleted(ewm) and the provisionally inserting operator cache-as-added(iwn). as described above. 
20 even when there is a difference in word order between an input phrase I and an example phrase E which are similar, it 
becomes possible to determine their similarity correctly. 

Also, as a result of the compounding operator compound(iwn.i,iwn) being introduced, it becomes possible to handle 
compound words as well, 

A calculating method using memory means like the cache 7A for temporarily storing provisionally deleted words 
25 and provisionally inserted words is applied to language models for voice recognition in for exanple "A Cache-based 
Natural Language Model for Speech Recognition". Roland Kuhn and Renato De Mori. IEEE Transactions on Pattern 
Analysis and Machine Intelligence, vol.12. No.6, June 1990. 

Here, a language model based on the hypothesis that the probability of a recently-used word appearing (being spo- 
ken) again is higher than the probability of that word first appearing is proposed, wherein recently-used words are 
30 Stored in menxxy means and the probability of an ith word Wj being a predetermined word W is calculated with refer- 
ence to this memory means. 

That is. for exanple in an ordinary 3g-gram nrxxJel. when the parts of speech of the i-1th and i-2th words are 
respectively written gj., and the probability P(W,=W|g,.2.g,.i) of the ith word Wj being a predetermined word W is 
cak:ulated for example using the following expression. 

35 

P(Wi=W|g,2.gM)= £ P(WrW|g,=gpP(gj=gj|g,2.g,,) (38) 

40 

Here. G is a set of parts of speech in the language that is the object of the voice recognition. 

In the aforementioned literature, a method of calculating the P(W j=W|g ,=g j) on the right side of Exp. (38) accord- 
ing to the following Exp. (39) by storing recently-used words in storing means is proposed. 

^ P(W(.W|g,-gp=kMj>cf(W,.Wlg,-gp+k,jxCj{W.i) (39) 

Here. k„j44Cc.j - 1 ard Kmj and kcj are both variables indexed only by j. 

The Cj(W.i) on the right side of Exp. (39) is a probability cak:ulated by referring to the memory means, and changes 
depending on whether the word W has appeared recently. 

so Tlierefore, the means in the preferred enr^bodiment shown in Fig. 15 has something in common with the means of 
the aforementioned literature, wherein words are stored in memory means and probabilities are calculated with refer- 
ence to that, in the point that words are stored in the cache 7A and probabilities are calculated with reference to this; 
however, it differs from the means of the aforementioned literature, wherein words having appeared recently are stored 
in the storing nneans. in that only provisionally deleted words and provisionally inserted words are stored in the cache 

55 7A. 

With the compounding operator compound(rWrt.^ .iw^. as well as it being possble to hande compound nouns of the 
kind mentioned above, it is possble for example to handle such times as when the example phrase E is 'INU WA 
KAWAir (*dogs are cute') and the input phrase I is WATASHI NO INU WA KAWAir ('my dog is cute*) and a conpound 
word 'WATASHI NO INU' ('my dog') made up of the modifier 'WATASHI NO' ('my') added to the word 'INU' ('dog') of the 
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input phrase I corresponds to the word MNU" of the example phrase E. However, in cases such as *WATASHI NO INU* 
( my dog') wherein it is clear that WASHI NO' Cmy-) modifies 'INU* Cdog*). by applying the replacing operator alter- 
wofdCINU'/WATASHI NO INU') to the 'INU' of the example phrase E and the *WATASHI NO INU* of the input phrase I 
and using a noun phrase example 'WATASHI NO X' ('my X') (where X represents a noun), it is possible to process the 
phrase correctly and more efficiently than when the compounding operator compound(iw„.,.iw„) is used. 

That is. the compounding operator compound(iw„.t.iwJ is effective in cases such as when a compound word of 
which the modifying relationship is unclear, for example one not containing the (possessive) particle *N0' (for example 
a compound noun) is inputted into the inputting part 1. or when in a voice recognition device in the inputting part 1 a 
partide is not recognized and for example 'WATASHl. INU WA KAWAir is outputted from the inputting part 1 with 
respect to an actual input of 'WATASHl NO INU WA KAWAir. 

In the above, a case wherein the invention is applied to a translating apparatus for carrying out translation from Jap- 
anese into English was described, but the invention can also be applied to translating apparatuses for performing trans- 
lation between other languages. 

It is possible to provide the translating apparatus with buttons for correcting and confirming translation results out- 
putted frorn the outputting part 16 (hereinafter for convenience referred to respectively as the correction button and the 
confirmation button). The translating apparatus can be made to carry out subsequent processing when the confirmation 
button is operated confirming that the translation result is correct. Also, the translating apparatus can be made to carry 
out translation processing again when the correction button is operated indicating that the translation result is wrong. 
That is. for example, the translating apparatus can be made to detect the next highest translation suitability and carry 
out translation on the basis of that example. Furthermore, it is also possible to provide the translating apparatus with a 
control part for correcting translation results outputted from the outputting part 16. In this case, when translation results 
have been conrected by the control part being operated, it is possible to stae exanrples corresponding to the results of 
this con-ection in the noun phrase translation example memory 8. the verb phrase translation exarrple memory 9 or the 
other translation example memory 1 0. 

Although in the preferred embodiments desaibed above a priori probability P(Ej) was found for each example Ej for 
example according to Exp. (5) or as desaibed above with reference to 1 1. for this priori probability P(Ei) it is also pos- 
sible to use the same value, for example the inverse 1/Ng of the number of examples Ng. for all the examples Ej. 

In these preferred embodiments. In the similarity degree calculating part 1 1 , the word replacement probability was 
found on the basis of the number of words included in a concept common to the two words concerned, but it Is also pos- 
sible for the word replacement probability to be found on the basis of the Iwel of the class to which the concept common 
to the two words belongs in the thesaurus, as in the related art. That is. when the number of classes of the thesaurus 
is written C and the level of the dass to which the concept common to a word iw„ of an input phrase I and a word ew^ 
of an example phrase E belongs is written L. it is possible to find the word replacement probability P(alter- 
word(ew^.iw J) according to the following expression. 

P{alter-word{ew ^.iwj)— 1^ (40) 



Also, although in the preferred enr^bodiments the word deletion probability and so on were determined on the basis 
of the part of speech of the word (whether the word is an independent word or a functional word, and so on), depending 
on the domain in which the translating apparatus is to be used there are cases in which it is effective to determine them 
on the basis of information other than the part of speech. That is. it is possible to determine the word deletion probability 
and so on taking into consideration factors such as that in spoken language pronouns of the first and second person 
and so on tend to be omitted and that words conveying a topic in context tend to be omitted because the topic is dear 
to the people talking. 

Also, it is possible to give the word insertion probability and the word deletion probability a predetermined constant 
value to apply in all cases, as shown in Exps. (41) and (42) respectively. 

P(add-word(iw „))-add-word (41 ) 

P(deIete-word(ew ^))=delete-word (42) 

The word deletion probability and so on can also be given according to a predetermined calculation formula. 

Although in the preferred embodiments translation was carried out in clause or phrase units, it is also possible for 
translation to be carried out in other syntax units. It is also possible for translation to be carried out without the first lan- 
guage sentence bang divided into such syntax units. Furthernrx>re, although in the preferred embodiments descrtoed 
above the example phrases were stored classified into noun phrases, verb phrases and other phrases, the method of 
classification is not limited to this. 
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As described above, in a translating apparatus and a translating method according to the invention, a first language 
sentence is divided into syntax units consisting of predetermined units of sentence structure and examples similar to 
these syntax units of the first language sentence are detected. Then, on the basis of these detected examples the first 
language sentence is translated in syntax units and the results of translation of the syntax units are compounded to 
generate a second language sentence. Therefore, efficient translation processing becomes possible and as a result the 
translation processing speed can be increased. 

In another translating method according to the invention, the first language sentence is divided into syntax units 
consisting of predetermined units of sentence structure, in stages from large syntax units into small syntax units, and 
the syntax units obtained at each stage are collated with examples, and examples corresponding to the syntax units are 
detected on the basis of the results of this collation. The first language sentence is then translated in syntax units on 
the basis of translations corresponding to these detected examples, and the results of translation of the syntax units are 
compounded to generate a second language sentence. -Rierefcre, efficient translation processing becomes possible 
and as a result the translation processing speed can be increased. 

In another translating apparatus and translating method according to the invention, a degree of similarity express- 
ing the similarity between a woid of a first language sentence and a word of an exarnple is calculated on the basis of 
the number of words induded in a concept common to the word of the first language sentence arxi the word of the 
example in a thesaurus wherein words are classified in a tree structure on the basis of their concepts. Therefore, it 
becomes possible to calculate an accurate degree of similarity irrespectVe of the structure of the thesaurus and as a 
result the translation processing speed can be increased. 

In another translating apparatus and another translating method according to the invention, an example similar to 
a first language sentence is detected on the basis of a priori probability of the exanv>te being used to translate the first 
language sentence and a transformation probability of the example being transformed into the first language sentence 
and used, and the first language sentence is translated on the basis of a translation corresponding to the detected 
example. Therefore, it is possible to perform translation processing which is suitable in terms of probability. 

In another translating apparatus and translating method according to the invention, by syntax analysis being per- 
formed on a first language sentence, grammatical attributes of the sentence are extracted and an example similar to the 
first language sentence is detected taking into account grammatical attributes of the first language sentence and of the 
exarrpla The first language sentence is then translated on the basis of a translation corresponding to the detected 
example. Therefore, it becomes unnecessary to store muttipio superficially different exanples having the same mean- 
ing content. 

Various other modifications and application examples are of the Invention are conceivable within the scope of the 
invention. Therefore, the invention is not limited to the preferred enrtxxjiments described above. 

Claims 



1. A translating apparatus for trarslating a first language sentence expressed in a first language into a secondlan- 
guage sentence expressed in a second language, comprising: 

example menx)ry means (3) storing examples expressed in said first language arxi corresponding translations 
of said exanrples in said second language; 

dividing means (4) for dividing said first language sentence into syntax units consisting of predetermined units 
of sentence structure; 

detecting means (6) for detecting for each of said syntax units of said first language sentence outputted by said 
dividing means an exarrple similar to that syntax unit from among said exanples; 

translating means (7) for translating said first language sentence in said syntax units on the basis of those of 
said translations conresponding to those of said examples detected by said detecting means (6); and 
generating means (1 5) for generating said second language sentence by compounding results of translation of 
said syntax units obtained by said translating means. 

2. A translating apparatus according to claim 1 . wherein said example memory means (3) stores said examples and 
said corresponding translations in said syntax units. 

3. A translating apparatus according to claim 1 or 2. wherein said dividing means (4) divides said first language sen- 
tence in stages from large syntax units to small syntaxunits. 

4. A translating apparatus according to claim 3. wherein said dividing means (4) comprises clause dividing means for 
dividing said first language sentence into dause units and phrase dividing means for dividing said dause units of 
said first language sentence outputted from said clause dividing means into phrase units. 
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5. A translating apparatus according to claim 4. wherein said example memory means (3) comprises clause unit 
example memory means (5) storing examples in clause units and 

phrase unit example memory means storing phrase examples in phrase units. 

6. A translating apparatus according to daim 5. wherein said phrase unit example memory means (3) stores said 
examples classified at least into noun phrases and vert phrases. 

7. A translating apparatus according to claim 3. wherein said dividing means (4) divides said first language sentenc j 
into large syntax units and when none of said examples matches any of said large syntax units divides said large 
syntax units into smaller syntax units. 

8. A translating apparatus according to claim 1 . wherein said example memory means (3) stores said examples 
together with grammatical attributes thereof. 

9. A translating apparatus according to claim 8, wherein said dividing means (4) divides said first language sentence 
into said syntax units and extracts grammatical attributes of said syntax units by carrying out syntax analysis and 
said detecting means (6) detects those of said examples similar to said clause units taking into account grammat- 
ical attributes of said syntax units and said examples. 

1 0. A translating apparatus according to anyone of the preceding claims, wherein said detecting means (6) detects said 
example similar to said syntax unit on the basis of a priori probability of said example being used to translate said 
syntax unit and a transformation probability of said example being transformed into said syntax unit and used. 

11. A translating apparatus according to claim 10. wherein said transfonnation probability is calculated from a gram- 
matical attribute transformation probability of grammatical attrtoutes of said example being transformed and a word 
transformation probability of words of said example being transformed. 

12. A translating apparatus according to claim 11, wherein said grammatical attribute transformation probability Is cal- 
culated from a grammatical attribute copy probability of a grammaticaJ attrtoute of said example being used 
unchanged, a grammatical attribute deletion probstoility of a grammatical attribute of said example being deleted, a 
grammatical attribute addition probability of a graovnatical attribute of said syntax unit being added, and a gram- 
matical attribute replacement probability of a grammatical attribute of said exannple being replaced with a grammat- 
ical attribute of said syntax unit. 

13. A translating apparatus according to claim 11, wherein said word transformation probability is calculated from a 
word copy prot>ability of a word of said example being used unchanged, a word deletion probability of a word of 
said exanrple being deleted, a word addition probability of a word of said syntax unit being added, and a word 
replacement probability of a word of said example being replaced with a word of said syntax unit. 

14. A translating apparatus according to daim 13. further comprising: 

thesaurus memory means (12) storing a thesaurus wherein words are classified in a tree structure on the basis 
of their concepts; and 

word similarity degree calculating means (11) for calculating a word similarity degree expressing a similarity 
between a word of said syntax unit and a word of said example on the basis of said thesaurus (12) and output- 
ting said word similarity degree as said word replacement probability. 

15. A trarKlating apparatus according to claim 14. wherein said word similarity degree calculating means (1 1) calcu- 
lates said word similarity degree on the basis of the number of words included In a concept common to said word 
of said syntax unit and said word of said example in said thesaurus. 

16. A translating apparatus according to claim 1 1. wherein said wad transformation probability Is calculated according 
to an optimum transformation path for transforming said example into said syntax unit. 

17. A translating apparatus according to claim 16. wherein said optinujm transformation path is found by dynamic pro- 
gramming. 

18. A translating apparatus according to daim 13, wherein said word ti-ansformation probability is calculatedfurther 
using at least one among a wad scramble probability of a wad of said example unchanged or replaced with a word 
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of said syntax unit being used with its position changed, a word copy and compound probability of a word of said 
example having a modifier added thereto and being used as a compound word, and a word replace and conpound 
probability of a word of said example replaced with a word of said syntax unit having a modifier added thereto and 
being used as a compound word. 

1 9. A translating apparatus according to claim 18, wherein said word transformation probability is calculated according 
to an optimum transformation path t)r transforming said example into said syntax unit. 

20. A translating apparatus according to claim 19. wherein in finding said optimum transformation path instead nf an 
actual deletion path corresponding to actually deleting a word of said example a provisional deletion path provision- 
ally deleting a word of said example is used and instead of an actual addition path corresponding to actually adding 
a word of said syntax unit a provisional addition path provisionally adding a word of said syntax unit Is used, 

further comprising word memory means for storing a provisionally deleted word and a provisionally added 
word provisionally deleted or added accx)fding to said provisional deletion path or said provisional addition path. 

21. A translating apparatus according to claim 20. wherein a scramble path corresponding to using a word of said 
example unchanged or replaced with a word of said syntax unit with its position changed is selected only when 
either a word of said example has been stored in said wad memory means as said provisionally deleted word or a 
word of said syntax unit has been stored in said word memay means as said provisionally added word, and there- 
after said word of said example stored as said provisionally deleted word or said word of said syntax unit stored as 
said provisionally added word is erased from said word memory means, and 

a copy and compound path corresponding to a word of said example having a modifier added thereto and 
being used as a compound word and a replace and compound path corresponding to a word of said example 
replaced with a word of said syntax unit having a nxxjifier added to it and being used as a compound word are 
selected only when a word to become said modifier has been stored in said word memory means as said provision- 
ally added word, and thereafter said word to become a modifier stored as said provisionally added word is erased 
from said word merrxjry means. 

22. A translating apparatus according to daim 21. wherein after the end point of a transformation path transforming 
said example Into said syntax unit is reached a provisional deletion path or a provisional adding path provisionally 
deleting or adding said provisionally deleted word or said provisionally added word stored in said word memory 
means is converted into an actual deleting path or an actual adding path respectively and the path obtained is 
made said optimum transformation path. 

23. A translating apparatus according to dalm 10, wheran said priori probability or said transformation probability is 
calculated on the basis of corpus data. 

24. A translating apparatus according to claim 10, wherein said priori probability or said transformation probability is 
calculated on the basis of said examples stored in said example memory means (3). 

25. A translating method for translating a first language sentence expressed in a first language into a second language 
sentence expressed in a second language using example memory means (3) storing examples expressed in said 
first language and corresponding translations of said examples in said second language, comprising the steps of: 

dividing (4) said first language sentence into syntax units consisting of predetermined units of sentence struc- 
ture; 

detecting (6) for each of said syntax units of said first language sentence an example similar to that syntax unit 
from among said examples: 

translating (7) said first language sentence in said syntax units on the basis of those of said translations cone- 
sponding to the detected examples; and 

generating (15) said second language sentence by compounding results of translation of said syntax units. 

26. A translating method for translating a first language sentence expressed in a first language into a second language 
sentence expressec* ^n a second language using example memory means (3,5) storing examples expressed in said 
first language and corresponding translations of said examples in said second language, comprising the steps of; 

dividing (4) said first language sentence into syntax units consisting of predetermined units of sentence struc- 
ture in stages from large syntax units into small syntax units and collating syntax units obtained at each stage 
with said examples; 



31 



EP 0 805 403 A2 



detecting (6) for each of said syntax units an example corresponding to that syntax unit from among said exam- 
ples on the basis of results of the collation; 

translating (7) said first language sentence in said syntax units on the basis of those of said translations cone- 
spofxiing to the detected examples; and 

generating (15) said second language sentence by compounding results of translation of said syntax units. 

27. A translating apparatus for translating a first language sentence expressed in a first language into a second lan- 
guage sentence expreF<=ed in a second language, comprising: 

example memory means (3.5) storing examples expressed in said first language and corresponding transla- 
tior^s of said examples in said second language; 

thesaurus memory means (12) storing a thesaurus wherein words are classified in a tree structure on the basis 
of their concepts: and 

calculating means (11) for calculating word similarity degrees each expressing the similarity between a word 
of said first language sentence and a word of one of said examples on the basis of said thesaurus (12); 
detecting means (6) for detecting from annong said examples an example similar to said first language sen- 
tence on the basis of said word similarity degrees calculated by said calculating means; and 
translating means (7) for translating said first language sentence on the basis of a translation among said 
translations corresponding to said example detected by said detecting means. 

wherein said calculating means (11) calculates each of said word similarity degrees on the basis of the 
number of words included in a concept common to a word of 

said first language sentence and a word of one of said examples in said thesaurus. 

28. A translating method for translating a first language sentence expressed in a first language into a second language 
sentence expressed in a second language using exanple memory means (3.5) storing exanrples expressed in said 
first language and corresponding translations of said examples In said second language, comprising the steps of: 

calculating (11) word similarity degrees each expressing the similarity between a word of said first language 
sentence and a word of one of said examples on the basis of a thesaurus (12) wherein words are classified In 
a tree structure on the basis of their concepts; 

detecting (6) from among said examples an exanple similar to said first language sentence on the basis of said 
word similarity degrees; arxl 

translating (7) said first language sentence on the basis of a translation among said translations corresponding 
to said exanple detected; 

wherein each of said word similarity degrees is calculated on the basis of the number of words included 
in a concept common to a word of said first language sentence and a word of one of said exanples in said the- 
saurus (12). 

29. A translating apparatus for translating a first language sentence expressed in a first language into a second lan- 
guage sentence expressed In a second language, comprising: 

example memory means (3.5) storing examples expressed In said first language and corresponding transla- 
tions of said examples in said second language; 

detecting means (6) for detecting from anxsng said examples an example similar to said first language serv 
tence on the basis of a priori probability of said example being used for translating said first language sentence 
and a transformation probability of said example being transformed Into said first language sentence and used; 
and 

translating means (7) for translating said first language sentence on the basis of a translation anx)ng said 
translations corresponding to said example delected by said detecting means (6). 

30. A translating apparatus according to claim 29. wherein said transformation probability is calculated from a gram- 
matical attribute transformation probability of grammatical attributes of said example being transformed and a word 
transformation probability of words of said example being transformed. 

31. A franslating apparatus according to claim 30. wherein said grammatical attribute transformation probability is cal- 
culated from a grammatical attribute copy probability of a grammatical attribute of said example being used 
unchanged, a grammatical attribute deletion probability of a grammatical attribute of said example being deleted, a 
grammatical attribute addition probability of a grammatical attrbute of said first language sentence being added. 
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and a grammatical attribute replacement probability of a grammatical attribute of said example being replaced with 
a grammatical attribute of said first language sentence. 

32. A translating apparatus according to claim 30, wherein said word transformation probability is calculated from a 
word copy probability of a word of said example being used unchanged, a word deletion probability of a word of 
said example being deleted, a word addition probability of a word of said first language sentence being added, and 
a word replacement probability of a word of said example being replaced with a word of said first language sen- 
tence. 

33. A translating apparatus acctxding to claim 29. wherein said word transformation probat>ility is calculated according 
to an optimum transformation path for transforming said example into said first language sentence. 

34. A translating apparatus according to claim 33, wherein said optimum transformation path is found by dynamic pro- 
gramming. 

35. A translating app)aratus according to daim 32, wherein said word transformation protiability is calculated further 
using at least one among a word scramble probability of a word of said example unchanged or replaced with a word 
of said first language sentence being used with its position changed, a word copy and compound probabilrty of a 
word of said example having a modifier added thereto and being used as a compound word, and a word replace 
and compourxJ probability of a word of said example replaced with a word of said first language sentence having a 
modifier added thereto and being used as a compound word, 

36. A translating apparatus according to claim 32, wherein said word transformation probability is calculated according 
to an optimum transformation path for transforming said example into said first language sentence. 

37. A translating apparatus according to daim 36. wherein in finding said optimum transformation path instead of an 
actual deletion path corresponding to actually deleting a word of said example a provisional deletion path provision- 
ally deleting a word of said example is used and instead of an actual addition path corresponding to actually adding 
a word of said first language sentence a provisional addition path provisiortally adding a word of said first language 
sentence is used, 

further comprising word memory means for storing a provisionally deleted wad artd a provisionally added 
word provisionally deleted or added according to said provisional deletion path or said provisional addition path. 

38. A translating apparatus according to daim 37. wherein a saamble path corresponding to using a word of said 
example unchanged or replaced with a word of said first language sentence with its position changed is seleded 
only when either a word of said exanrple has been stored in said word memory means as said provisionally deleted 
word or a word of said first language sentence has been stored in said worxj memory means as said provisionally 
added word, and thereafter said word of said example stored as said provisionally deleted word or said word of said 
syntax unit stored as said provisionally added word is erased from said word memory means, and 

a copy and compound path corresponding to a word of said example having a modifier added thereto and 
being used as a compound word and a replace and compound path corresponding to a word of said exanpte 
replaced wrth a word of said first language senterxie having a nxxJif ier added to it and being used as a compound 
word are selected only when a word to become said modifier has been staed in said word memory means as said 
provisionally added word, and thereafter said word to become a nxxjifier stored as said provisionally added word 
is erased from said word memory means. 

39. A translating apparatus according to daim 38, wherein after the end point of a transformation path transforming 
said exanrple into said first language sentence is reached a provisional deletion path or a provisional addition path 
provisionally deleting or adding said provisionally deleted word or said provisionaily added word stored in said word 
memory means is converted into an actual deleting path or an actual addition path respectively and the path 
obtained is made the optimum transformation path. 

40. A translating method for translating a first language sentence expressed in a first language into a second language 
serrtence expressed in a second language using exanrple memory means (3.5) storing examples expressed in said 
first language and corresponding translations of said exanrples in said second language, conprising the steps of: 

detecting (6) from among said exarrples an exanple similar to said first language sentence on the basis of a 
priori probability of said example being used for translating said first language sentence and a transformation 
probability of said example being transformed into said first language sentence and used: and 
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translating (7) said first language sentence on the basis of a translation among said translations corresponding 
to said example detected. 

41. A translating apparatus for translating a first language sentence expressed in a first language into a second lan- 
guage sentence expressed in a second language, comprising: 

example memory means (3,5) storing examples expressed in said first language and corresponding transla- 
tions of said examples in said second language together with grammatical attributes of said examples: 
extracting means for extracting grammaucal attributes of said first language sentence by carrying out syntax 
analysis of said first language sentence: 

detecting means (6) for detecting among said examples an example similar to said first language sentence tak- 
ing into account grammatical attributes of said first language sentence and of said examples: and 
translating means (7) for translating said first language sentence on the basis of a translation arrwng said 
translations corresponding to said example detected by said detecting means. 

42. A translating method for translating a first language sentence expressed in a first language into a second language 
sentence expressed in a second language using example memory means storing examples expressed in said first 
language and corresponding translations of said examples in said second language and grammatical attributes of 
said examples, corrprising the steps of: 

extracting grammatical attributes of said first language sentence by candying out syntax analysis of said first 
language sentence: 

detecting (6) among said examples an example similar to said first language sentence taking into account 
grammatical attributes of said first language sentence and of saidexamples: and 

translating (7) said first language sentence on the basis of a translation among said translations which corre- 
sporxfs to said example detected. 
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